Definition

Formal Definition of the Masking Process in MLM

In Masked Language Modeling, the masking process is formally defined for an input sequence x=x0...xm\mathbf{x}=x_0...x_m. A set of positions, \mathcal{A}(\mathbf{x}) = \{i_1,...,i_{u}\}$, is selected to be masked. This procedure generates a modified token sequence, \bar{\mathbf{x}},inwhichthetokenateachselectedpositionin, in which the token at each selected position in \mathcal{A}(\mathbf{x})isreplacedwithaspecialis replaced with a special[\mathrm{MASK}]$$ symbol.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences