1Cademy - Handling Labeler Disagreement in Reward Modeling

Learn Before

Empirical Formulation of Pair-wise Ranking Loss

Case Study

Handling Labeler Disagreement in Reward Modeling

Based on the empirical formulation of the pair-wise ranking loss, which incorporates preference probabilities, explain how the 70/30 split in labeler preference for this specific data point influences the loss calculation and the subsequent update to the model's parameters. Contrast this with a scenario where all 10 labelers agreed that y_A was the preferred response.

Updated 2025-10-09

Contributors are:

Who are from:

Learn Before

Related