1Cademy - A developer is fine-tuning a language model on a dataset of `[instruction, response]` pairs. Initially, the training process calculated the prediction loss across all tokens in both the `instruction` and the `response`. The developer then modifies the process to calculate loss *only* on the tokens in the `response`. What is the primary effect of this change on the models training objective?

Learn Before

Conditional vs. Joint Probability Objectives in Language Modeling

Multiple Choice

A developer is fine-tuning a language model on a dataset of [instruction, response] pairs. Initially, the training process calculated the prediction loss across all tokens in both the instruction and the response. The developer then modifies the process to calculate loss only on the tokens in the response. What is the primary effect of this change on the model's training objective?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related