Learn Before
Concept

Pointwise Mutual Information

Pointwise Mutual Information (PMI) is a measure of how often two events occur, compared with what we would expect if they were independent.

An alternative weighting function to tf-idf, positive pointwise mutual information is used for term-term-matrices, when the vector dimensions correspond to words rather than documents.

(PMI(w,c)=log2(P(w,c)/P(w)P(c))(PMI(w,c) = \log _{2} (P(w,c)/P(w)P(c))

(PPMI(w,c)=max(log2(P(w,c)/P(w)P(c)),0)(PPMI(w,c) = \max(log _{2} (P(w,c)/P(w)P(c)),0)

0

1

Updated 2022-06-29

Tags

Data Science