1Cademy - Segment-Based Rating Loss Function

Learn Before

Training a Reward Model on Segment-Level Scores via Regression Loss

Formula

Segment-Based Rating Loss Function

When segment-level rating scores are available, a reward model can be trained using pointwise methods and a regression loss function. This loss function calculates the negative expected squared difference between the target rating score for a segment and the reward model's predicted score. The formula is expressed as: $\mathcal{L}_{\mathrm{rating}} = -\mathbb{E}_{\bar{\mathbf{y}}_k} \big[ s(\bar{\mathbf{y}}_k) - r(\mathbf{x}, \mathbf{y}, \bar{\mathbf{y}}_k) \big]^2$ In this equation, $s(\bar{\mathbf{y}}_k)$ is the target rating score for segment $\bar{\mathbf{y}}_k$ , and $r(\mathbf{x}, \mathbf{y}, \bar{\mathbf{y}}_k)$ is the reward predicted by the model for that segment given the prompt $\mathbf{x}$ and full output $\mathbf{y}$ .

0

1

Updated 2026-05-03

Contributors are:

Who are from:

References

Learn Before

Related

Learn After