Learn Before
Concept

Noisy channel algorithm for real-word correction

The algorithm takes the input sentence X=x1,x2,...,xk,...,xnX={x_{1}, x_{2},..., x_{k},..., x_{n}} and generates a set of candidate words C(xi)C(x_{i}) for each word xix_{i}. The candidates include every word with a small edit distance from xix_{i}. Then we pick the sentence with the highest language model probability.

Each sentence is scored by the noisy channel:

Image 0

0

1

Updated 2022-05-13

Tags

Data Science