Example

Example Comparison of Token Masking and Token Deletion

The distinction between token masking and token deletion can be demonstrated using an original sequence, denoted as x\mathbf{x}: The puppies are frolicking outside the house .. When applying Token Masking to create a noisy sequence xnoise\mathbf{x}_{\mathrm{noise}}, selected tokens such as 'frolicking' and 'the' are replaced with a special symbol, resulting in: The puppies are [MASK] outside [MASK] house .. Conversely, applying Token Deletion to form xnoise\mathbf{x}_{\mathrm{noise}} completely removes the selected tokens from the sequence (e.g., frolicking and the), yielding: The puppies are outside house ..

0

1

Updated 2026-04-16

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related