Essay

Analyzing Prompt Design for LLM Evaluation

Analyze why the prompt 'Please calculate the average of the numbers 2, 4, and 9' is considered a test of a Large Language Model's ability to perform a direct mathematical operation, rather than its complex reasoning skills. In your analysis, create a contrasting prompt that would be designed to test the model's step-by-step reasoning for the same calculation and explain why your new prompt is different.

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science