Learn Before
Definition

Tokens and Words in NLP

In Natural Language Processing, text is processed by first breaking it down into basic units called tokens via a process known as tokenization. Although the terms 'token' and 'word' are often used synonymously, they are not identical. A token represents a segment of text, which could be a word, but might also be punctuation or a part of a word, depending on the tokenization method used.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related