1Cademy - Decomposing a Ranked List into Pairwise Preferences

Learn Before

Listwise Loss from Accumulated Pairwise Comparisons

Short Answer

Decomposing a Ranked List into Pairwise Preferences

A common method for training a preference model is to take a ranked list of responses and break it down into all its constituent pairwise preferences. For example, the ranking A > B > C is decomposed into three preferences: A is preferred over B, A is preferred over C, and B is preferred over C. Following this method, if a human annotator provides a ranked list of 5 distinct responses, how many individual pairwise preferences will be extracted from this single list for the loss calculation?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related