1Cademy - During a language models training, a specific token is chosen from an input sequence to be predicted. In a small percentage of cases, the training strategy requires this chosen token to be left as-is, without being replaced. Consider the original sequence: `[CLS] The quick brown fox jumps . [SEP]`. If the token fox is selected for prediction but falls under the rule where it remains unchanged, what is the final input sequence fed to the model for this training step?

Learn Before

Example of an Unchanged Token in a BERT Input Sequence

Multiple Choice

During a language model's training, a specific token is chosen from an input sequence to be predicted. In a small percentage of cases, the training strategy requires this chosen token to be left as-is, without being replaced. Consider the original sequence: [CLS] The quick brown fox jumps . [SEP]. If the token 'fox' is selected for prediction but falls under the rule where it remains unchanged, what is the final input sequence fed to the model for this training step?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related