1Cademy - In a specific pre-training setup for a language model, a single input (composed of one or two sentences) is used to perform two distinct tasks simultaneously: one task involves predicting words that have been intentionally hidden in the text, and the other involves determining the relationship between the two sentences (e.g., if one follows the other). Which statement accurately describes how the performance on these two tasks is used to update the model?

Learn Before

Concurrent Loss Calculation for MLM and NSP

Multiple Choice

In a specific pre-training setup for a language model, a single input (composed of one or two sentences) is used to perform two distinct tasks simultaneously: one task involves predicting words that have been intentionally hidden in the text, and the other involves determining the relationship between the two sentences (e.g., if one follows the other). Which statement accurately describes how the performance on these two tasks is used to update the model?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related