1Cademy - Example of an Unchanged Token in a BERT Input Sequence

Learn Before

Unchanged Tokens in BERT's MLM Strategy
Example of a Two-Sentence Input for BERT

Example

Example of an Unchanged Token in a BERT Input Sequence

To illustrate the strategy of leaving a selected token unchanged in BERT's Masked Language Modeling, consider the original input: [CLS] It is raining . [SEP] I need an umbrella . [SEP]. If the token 'I' is chosen for prediction but falls under the 10% rule where the token is left as is, the input sequence fed to the model remains identical to the original. Despite the token not being masked or altered, the model is still tasked with predicting 'I' based on the surrounding context.

Updated 2026-05-02

Contributors are: