Evaluating Chain-of-Thought Demonstrations
You are designing a prompt to teach a language model how to solve multi-step arithmetic problems. Below are two demonstrations for the same problem. Which demonstration (A or B) is a more effective example for this purpose? Justify your choice by explaining what makes it a better template for the language model to learn from.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models Course
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Step 1: Calculate Boris's Final Apple Count (Boris and Beck's Apples Problem)
An arithmetic word problem about Boris and Beck's apples is used as a demonstration within a prompt for a language model. The demonstration includes the problem statement, a sequence of intermediate reasoning steps, and the final answer. What is the primary purpose of including the 'intermediate reasoning steps' in this context?
Evaluating Chain-of-Thought Demonstrations
Calculation Annotation in CoT Demonstrations
Final Answer Token in CoT Demonstrations
An arithmetic word problem is used to demonstrate a step-by-step reasoning process. The problem is: 'Boris starts with 100 apples. Beck has 23 fewer apples than Boris. Boris then gives 10 apples to Beck. How many more apples does Boris have than Beck in the end?' Arrange the following reasoning steps into the correct logical order to solve the problem.