Concept

Example 3: Social Science

A country conducts a census survey, including socio-demographic questions (age, gender, education, profession, number of children, salary, etc.). The question is to determine which factor influences which other factor.

The dataset consists of joint answers from nn citizens for factor pairs (Xk,Yk),k=1,N:Sk={(xk1,yk1),,(xkn,ykn}( X_k , Y_k ), k = 1, ⋯ N : S_k = \{( x_{k 1} , y_{k 1} ), ⋯ , ( x_{kn} , y_{kn} \} . Prior knowledge allows us to determine the ground truth for a few pairs (Xk,Yk)( X_k , Y_k ) (for example age can influence wages and not vice-versa).

This constitutes an empirical “mother distribution” of pairs {(S1,g1),,(Sk,gk),,(SN,gN)}\{( S_1 , g_1 ), ⋯ , ( S_k , g_k ), ⋯ , ( S_N , g_N )\} from which we might train a classifier to label other similar pairs with their causal direction.

0

1

Updated 2020-07-14

Tags

Data Science

Related