A team is training a language model using human feedback. For a given prompt, the model generates two distinct responses, Response A and Response B. A human evaluator indicates a preference for Response A over Response B. To learn from this feedback, the system uses a probabilistic model designed for pairwise comparisons to quantify this preference. Which statement best analyzes how this model represents the human's choice?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A team is training a language model using human feedback. For a given prompt, the model generates two distinct responses, Response A and Response B. A human evaluator indicates a preference for Response A over Response B. To learn from this feedback, the system uses a probabilistic model designed for pairwise comparisons to quantify this preference. Which statement best analyzes how this model represents the human's choice?
Interpreting Preference Data for AI Training
Justifying the Choice of a Preference Model
Derivation of the Bradley-Terry Preference Formula