1Cademy - Reward Model Objective Calculation

Learn Before

Negative Mean Squared Error Objective for Pointwise Reward Models

Case Study

Reward Model Objective Calculation

You are training a reward model where the goal is to align the model's predicted scores with human-provided scores. The training process aims to maximize the objective function defined as $\mathcal{L}_{\text{point}} = -[\varphi(\mathbf{x}, \mathbf{y}) - r(\mathbf{x}, \mathbf{y})]^2$ . Based on the case study below, calculate the initial objective value and analyze the effect of a model update.

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related