1Cademy - A developer is fine-tuning a language model on a dataset where each entry consists of a context and a desired completion. For training, the context and completion are concatenated into a single input sequence. The training objective is configured so that the loss is calculated *only* on the models predictions for the completion part of the sequence. Given this setup, which statement accurately describes how the models parameters are updated during the backward pass for a single training step?

Learn Before

Selective Gradient Propagation for Sub-sequence Loss

Multiple Choice

A developer is fine-tuning a language model on a dataset where each entry consists of a context and a desired completion. For training, the context and completion are concatenated into a single input sequence. The training objective is configured so that the loss is calculated only on the model's predictions for the completion part of the sequence. Given this setup, which statement accurately describes how the model's parameters are updated during the backward pass for a single training step?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related