Formula

Plackett-Luce Selection Probability Formula

In the Plackett-Luce model, the probability of selecting a specific response y\mathbf{y} from a set of possible responses YY given an input x\mathbf{x}, is calculated by normalizing its "worth" value, α(y)\alpha(\mathbf{y}). The selection probability is the worth of the selected response divided by the sum of the worths of all possible responses: Pr(y is selectedx,Y)=α(y)yYα(y)=exp(r(x,y))yYexp(r(x,y))\Pr(\mathbf{y}\text{ is selected}|\mathbf{x},Y) = \frac{\alpha(\mathbf{y})}{\sum_{\mathbf{y}' \in Y} \alpha(\mathbf{y}')} = \frac{\exp(r(\mathbf{x},\mathbf{y}))}{\sum_{\mathbf{y}' \in Y} \exp(r(\mathbf{x},\mathbf{y}'))}

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related