Applying the Listwise Loss Summation
A human annotator is given a prompt and three possible responses: , , and . The annotator provides a ranked list where the preference order is . Based on the summation part of the listwise loss formula, , list all the individual log probability terms that would be summed up for this specific ranked list .
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Consider the following formula for a loss function used to train a model on ranked lists of outputs, where
Nis the number of items in a given listY:What is the primary analytical consequence of including the normalization term in this calculation?
Applying the Listwise Loss Summation
Consider the listwise loss formula used for training on ranked preferences:
True or False: If a model is completely uncertain about the preferences within a ranked list (i.e., it assigns for all distinct pairs), the contribution of that specific list to the overall loss will be zero.