Learn Before
Concept

Modeling Preference Probability with the Bradley-Terry Model in RLHF

In the context of RLHF, the Bradley-Terry model is adapted to formally express the probability that a given model output, yay_a, is preferred over another, yby_b. This application of the model provides a mathematical framework for quantifying human preferences.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences