1Cademy - A language model is being trained with the objective of modeling the joint probability of an input sequence `x` and an output sequence `y`, which are treated as a single, concatenated sequence. During a single training step for this combined sequence, how is the models performance error (loss) calculated?

Learn Before

General Language Modeling Objective based on Joint Log-Probability

Multiple Choice

A language model is being trained with the objective of modeling the joint probability of an input sequence x and an output sequence y, which are treated as a single, concatenated sequence. During a single training step for this combined sequence, how is the model's performance error (loss) calculated?

Updated 2025-10-01

Contributors are:

Who are from:

Learn Before

Related