Concept

Interpretation for Statistical Significance Test Formula

δ(x)\delta (x) is the performance difference between A and B on a particular test set x. This performance can be accuracy or F-measure. A bigger δ means that A seems to be way better than B; a small δ means A seems to be only a little better.

δ(X)\delta (X) is the performance difference between A and B on X, which ranges over all test sets.

H0H_{0} is the null hypothesis, meaning that A is not better than B.

p-value is the probability, assuming the null hypothesis H0H_{0} is true, of seeing the δ (x) that we saw or one even greater.

0

2

Updated 2021-10-24

Tags

Data Science