1Cademy - Predicting from Corrupted Input

Learn Before

Random Token Replacement in BERT's MLM Strategy

Short Answer

Predicting from Corrupted Input

During a language model's pre-training, the input sentence 'The quick brown fox jumps over the lazy dog' is modified. The token 'jumps' is selected for prediction and, as part of the training strategy, is replaced by a random token, 'sings'. The model is then fed the corrupted input: 'The quick brown fox sings over the lazy dog'. What is the model's specific objective for the token at the position of 'sings'?

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related