Concept

Unchanged Tokens in BERT's MLM Strategy

In BERT's Masked Language Modeling strategy, 10% of the tokens that are chosen for prediction are kept in their original, unchanged form within the input sequence.

0

1

Updated 2026-04-17

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences