1Cademy - A model is trained using a two-stage process. In the first stage, given an input context `c`, the model identifies an optimal output sequence, `ŷ`. In the second stage, the models parameters are updated to maximize the probability of generating that same sequence `ŷ`, but this time conditioned on a slightly modified version of the original context, `c`. What is the primary reason for using the modified context `c` in the second stage instead of the original context `c`?

Learn Before

Log-Probability Loss with Model-Generated Target

Multiple Choice

A model is trained using a two-stage process. In the first stage, given an input context c, the model identifies an optimal output sequence, ŷ. In the second stage, the model's parameters are updated to maximize the probability of generating that same sequence ŷ, but this time conditioned on a slightly modified version of the original context, c'. What is the primary reason for using the modified context c' in the second stage instead of the original context c?

Updated 2025-10-05

Contributors are:

Who are from:

Learn Before

Related