1Cademy - Bradley-Terry Model

Learn Before

Pairwise Comparison for Human Feedback in RLHF

Theory

Bradley-Terry Model

The Bradley-Terry model, introduced by Bradley and Terry in 1952, is a simple and widely used probabilistic model for describing pairwise comparisons. It is designed to estimate the probability that one item will be preferred over another in a paired choice scenario.