Learn Before
Evaluating Prompt Demonstration Quality
A developer is creating a prompt to classify customer reviews into 'Positive', 'Negative', or 'Neutral' categories. Analyze the set of demonstrations provided in the prompt below. What is the primary weakness of this set of demonstrations, and why could it lead to poor performance when the model is used on a diverse set of new, unseen reviews?
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Evaluating Prompt Demonstration Quality
A developer is creating a prompt to solve multi-step math word problems. The prompt includes several examples of problems and their final answers. However, the model frequently makes logical errors on new, unseen problems. Based on principles for optimizing in-context examples, what is the most likely flaw in the prompt's design?
Improving Demonstrations for Logical Reasoning