Concept

Lemmatization

A lemma is a word that represents a whole group of words, and that group of words is called a lexeme. Lemmatization is the task of determining that two words have the same root, despite their surface differences. For instance, the word 'friends' will become 'friend' and 'worst' will become 'bad'. To start lemmatizing, use:

from nltk.stem import WordNetLemmatizer

To lemmatize a sentence, use word_tokenize first then iterate through the list with a loop.

0

1

Updated 2021-09-12

References


Tags

Python Programming Language

Text Normalization

Data Science

Learn After