Concept

Token Masking as an Input Corruption Method

Token masking is a technique for corrupting input data where specific tokens within a sequence are chosen at random and replaced with a special [MASK] token. This method is identical to the masking strategy employed in Masked Language Modeling (MLM).

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences