Learn Before
Concept
Interpretation for Statistical Significance Test Formula
is the performance difference between A and B on a particular test set x. This performance can be accuracy or F-measure. A bigger δ means that A seems to be way better than B; a small δ means A seems to be only a little better.
is the performance difference between A and B on X, which ranges over all test sets.
is the null hypothesis, meaning that A is not better than B.
p-value is the probability, assuming the null hypothesis is true, of seeing the δ (x) that we saw or one even greater.
0
2
Updated 2021-10-24
Tags
Data Science