Case Study

Improving AI Reasoning with Better Prompts

A developer is trying to build a system that uses a large language model to solve simple logical ordering problems. To guide the model, they provide the following example in the prompt:

Question: Sarah is older than Michael. Michael is older than David. Who is the youngest? Answer: David.

The developer finds that the model performs poorly when given new, slightly different problems (e.g., 'Who is the oldest?').

Evaluate the developer's example prompt. Explain the fundamental flaw in its design and rewrite the example to include the necessary components that will more effectively teach the model how to solve this class of problem.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science