1Cademy - Training a Reward Model on Segment-Level Scores via Regression Loss

Learn Before

Segment Score as Difference of Sequence Scores

Activity (Process)

Training a Reward Model on Segment-Level Scores via Regression Loss

Once scores for individual segments are computed, these segment-level scores can serve as the target values for training a reward model. The training is structured as a regression task, where the model's parameters are optimized by minimizing a regression loss function. This loss function quantifies the difference between the model's predicted scores and the calculated segment scores.

Updated 2026-05-03

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Segment-Based Rating Loss Function
A team is training a model to predict a quality score for individual segments of a generated text. The training process is designed as a regression task, aiming to minimize the difference between the model's predicted scores and pre-calculated target scores for each segment. After one training step, the model's performance on three specific segments is as follows:
- Segment 1: Target Score = 0.9, Predicted Score = 0.8
- Segment 2: Target Score = 0.1, Predicted Score = 0.5
- Segment 3: Target Sc
Analyzing Reward Model Parameter Updates
When training a reward model on segment-level scores using a regression loss, the primary objective is to ensure the model's predicted scores for different segments maintain the same relative order (ranking) as the target scores, even if the absolute values of the predictions are consistently different from the targets.

Learn Before

Related

Learn After