1Cademy - A language model is being trained using a masked language modeling objective. The original input sentence is A quick brown fox jumps over the lazy dog. During a training step, the tokens quick (at position 2) and lazy (at position 8) are masked. The model receives the corrupted input, denoted as $\bar{\mathbf{x}}$: [CLS] A [MASK] brown fox jumps over the [MASK] dog. Which of the following mathematical expressions correctly represents the training objective for this specific step, which the model aims to maximize?

Learn Before

Example of MLM Training Objective with Multiple Masks

Multiple Choice

A language model is being trained using a masked language modeling objective. The original input sentence is 'A quick brown fox jumps over the lazy dog'. During a training step, the tokens 'quick' (at position 2) and 'lazy' (at position 8) are masked. The model receives the corrupted input, denoted as $\bar{\mathbf{x}}$ : '[CLS] A [MASK] brown fox jumps over the [MASK] dog'. Which of the following mathematical expressions correctly represents the training objective for this specific step, which the model aims to maximize?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related