1Cademy - Conditional Probability of Pairwise Preference

Learn Before

Data-Generating Process and Data-Generating Distribution (in Machine Learning)

Formula

Conditional Probability of Pairwise Preference

The formula $\text{Pr}(\mathbf{y}_{k_1} \succ \mathbf{y}_{k_2} | \mathbf{x})$ represents the conditional probability that an outcome $\mathbf{y}_{k_1}$ is preferred to, or ranked higher than, another outcome $\mathbf{y}_{k_2}$ , given a specific input context $\mathbf{x}$ . The symbol $\succ$ denotes preference or a 'greater than' relationship in this context. This type of expression is fundamental in preference learning and ranking models, where the goal is to learn a function that can predict the relative order of items based on input features.

Updated 2026-06-21

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Bradley-Terry Model for Pairwise Preference Probability
Ranking Chatbot Responses
A user provides the prompt, denoted as 'x', 'Translate the phrase "hello world" into French.' to a language model. The model generates two responses: Response A ('y_A'), which is 'Bonjour le monde', and Response B ('y_B'), which is 'Salut monde'. A human evaluator indicates that Response A is a better translation than Response B. Which of the following expressions correctly represents the probability of this specific preference, given the user's prompt?
Modeling Pairwise Preference Probability with a Reward Function
Interpreting Preference Probability Notation

Learn Before

Related

Learn After