Learn Before
Model Performance Evaluation using Plackett-Luce Loss
Based on the provided data and the objective of minimizing the negative expected log-probability of the ground-truth sequences, which model is performing better on this dataset? Justify your answer.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Analysis of Ranking Error Penalties
A language model is being trained on a preference dataset. For a single input prompt, the ground-truth ranked sequence of responses is
Y. The model calculates the probability of observing this exact sequence asPr(Y|x) = 0.25. Based on the formula for the objective function that maximizes the likelihood of the model predicting the correct rankings, what is the loss value for this single data point?Model Performance Evaluation using Plackett-Luce Loss