Multiple Choice

A machine learning engineer is training a reward model where human annotators assign an absolute quality score to each generated text. The engineer considers switching the loss function from Mean Squared Error (MSE), which calculates (human_score - predicted_reward)^2, to Mean Absolute Error (MAE), which calculates |human_score - predicted_reward|. What is the most significant consequence of this change on the reward model's learned behavior?

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science