Concept

Plackett-Luce Loss Function

The Plackett-Luce loss function is derived directly from the log-probability of a ground-truth ranked sequence. It is defined as the negative log-likelihood of observing this correct sequence, a formulation rooted in the principle of maximum likelihood estimation. The training objective is to minimize this loss, which effectively maximizes the model's probability of predicting the correct rankings. This loss is typically averaged over all samples in a dataset by taking the expectation of the negative log-probability.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences