Learn Before
Relation

The noisy channel algorithm

  • The first stage of the algorithm proposes candidate corrections by finding words that have a similar spelling to the input word. Analysis of spelling error data has shown that the majority of spelling errors consist of a single-letter change and so we often make the simplifying assumption that these candidates have an edit distance of 1 from the error word. To find this list of candidates we’ll use the minimum edit distance algorithm introduced in Chapter 2, but extended so that in addition to insertions, deletions, and substitutions, we’ll add a fourth type of edit, transpositions, in which two letters are swapped. The version of edit distance with transposition is called Damerau-Levenshtein edit distance.
Image 0

0

1

Updated 2022-05-14

Contributors are:

Who are from:

Tags

Data Science