A research team is creating instructions for human labelers who will be rating the quality of two different AI-generated responses to a user's query. The team wants to include an example in their instructions that not only shows a preference but also models a clear, step-by-step reasoning process to guide the labelers. Which of the following examples best accomplishes this goal?
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Diagnosing Inconsistent Preference Labeling
A research team is creating instructions for human labelers who will be rating the quality of two different AI-generated responses to a user's query. The team wants to include an example in their instructions that not only shows a preference but also models a clear, step-by-step reasoning process to guide the labelers. Which of the following examples best accomplishes this goal?
Improving a Preference Labeling Prompt with Chain-of-Thought