Learn Before
Code

Interpolated Kneser-Ney Smoothing for Bigrams

PKN(wiwi1)=max(C(wi1wi)d,0)C(wi1)+λ(wi1)PCONTINUATION(wi)P_{KN}(w_i|w_{i-1}) = \frac{max(C(w_{i-1}w_i) - d, 0)}{C(w_{i-1})}+\lambda(w_{i-1})P_{CONTINUATION}(w_i)

Where λ is a normalizing constant to distribute probability mass:

λ(wi1)=dvC(wi1v){w:C(wi1w)>0}\lambda(w_{i-1}) = \frac{d}{\sum_vC(w_{i-1}v)}|\{w:C(w_{i-1}w)>0\}|

0

1

Updated 2022-06-29

Tags

Data Science