Case Study

Evaluating Chain-of-Thought Demonstrations

You are designing a prompt to teach a language model how to solve multi-step arithmetic problems. Below are two demonstrations for the same problem. Which demonstration (A or B) is a more effective example for this purpose? Justify your choice by explaining what makes it a better template for the language model to learn from.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models Course

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science