Learn Before
Multiple Choice

An AI development team uses a simple method to refine their prompt candidates: they evaluate all prompts on a small dataset, assign each a performance score, and then discard the bottom 80%, keeping only the top 20% for further testing. What is the most significant risk of relying solely on this approach?

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science