1Cademy - Huge Language Models

How it works Courses Research Communities Benefits About Us

Learn Before

N-gram Language Modeling

Concept

Huge Language Models

Efficiency considerations are important when building language models that use such large sets of n-grams. Some techniques are:

Storing words as 64-bit hash numbers in memory as opposed to strings.
Pruning, only storing n-grams with counts greater than some threshold.
Building approximate language models.
Stupid backoff

0

3

Updated 2021-10-03

Contributors are:

Xinrong Yao

Who are from:

University of Michigan - Ann Arbor

University of Michigan - Ann Arbor

References

Speech and Language Processing (3rd ed. draft)

Tags

Data Science

Related

Learn After

Stupid Backoff