1Cademy - A machine learning engineer is training a reward model where human annotators assign an absolute quality score to each generated text. The engineer considers switching the loss function from Mean Squared Error (MSE), which calculates `(human_score - predicted_reward)^2`, to Mean Absolute Error (MAE), which calculates `|human_score - predicted_reward|`. What is the most significant consequence of this change on the reward models learned behavior?

Learn Before

Pointwise Loss Function for Reward Model Training

Multiple Choice

A machine learning engineer is training a reward model where human annotators assign an absolute quality score to each generated text. The engineer considers switching the loss function from Mean Squared Error (MSE), which calculates (human_score - predicted_reward)^2, to Mean Absolute Error (MAE), which calculates |human_score - predicted_reward|. What is the most significant consequence of this change on the reward model's learned behavior?

Updated 2025-10-04

Contributors are:

Who are from:

Learn Before

Related