1Cademy - Plackett-Luce Loss Formula

Learn Before

Plackett-Luce Loss Function

Formula

Plackett-Luce Loss Formula

Given the log-probability $\log \Pr(\mathring{Y} | \mathbf{x})$ of a ground-truth ordered list mathring{Y} conditioned on an input $\mathbf{x}$ , the loss function based on the Plackett-Luce model is defined as the expected negative log-probability over the preference dataset $\mathcal{D}_r$ . The formula is: $\mathcal{L}_{\mathrm{pl}} = -\mathbb{E}_{(\mathbf{x},\mathring{Y}) \sim \mathcal{D}_r} \big[ \log \Pr(\mathring{Y} | \mathbf{x}) \big]$

Updated 2026-05-02

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Analysis of Ranking Error Penalties
A language model is being trained on a preference dataset. For a single input prompt, the ground-truth ranked sequence of responses is Y. The model calculates the probability of observing this exact sequence as Pr(Y|x) = 0.25. Based on the formula for the objective function that maximizes the likelihood of the model predicting the correct rankings, what is the loss value for this single data point?
Model Performance Evaluation using Plackett-Luce Loss

Learn Before

Related

Learn After