Concept

Causation Coefficient

Consider a pair of random variables (X,Y)( X , Y ) with ground truth G{XY,XY,XY,XY}G ∈\{ X → Y , X ← Y , X \leftrightarrow Y , X ⊥ Y \} , drawn from an unknown mother distribution. We call an estimator G^\hat{G} of GG based on (X,Y)( X , Y ).

The authors define a causation coefficient as follows: A causation coefficient C(X,Y)C ( X , Y ) is a real scalar value, such that the larger C(X,Y)C ( X , Y ), the more confident we are that G=XYG = X → Y . A causation coefficient must have the following desired properties:

  1. [Anti-symmetry] C(X,Y)=C(Y,X)C(X,Y) = -C(Y,X)
  2. [Discriminant] C(X,Y)>θ0G^=[XY]C(X,Y)>\theta \leq 0 \Rightarrow \hat{G} = [X\rightarrow Y] (and G^[XY]\hat{G} \neq [X\rightarrow Y] otherwise)
  3. [Arbitrary units] C(aX+b,cY+d)=C(X,Y);a,b,c,dR,a0,c0C(aX+b, cY+d) = C(X,Y); a,b,c,d\in R, a\neq 0, c\neq 0

0

1

Updated 2020-07-28

Tags

Data Science