Calculation Annotation in CoT Demonstrations
In Chain-of-Thought prompting demonstrations, special characters like ≪ ... ≫ can be used to explicitly annotate detailed calculation steps. This formatting, referred to as calculation annotation, helps guide a language model to show its work clearly within the reasoning process.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Step 1: Calculate Boris's Final Apple Count (Boris and Beck's Apples Problem)
An arithmetic word problem about Boris and Beck's apples is used as a demonstration within a prompt for a language model. The demonstration includes the problem statement, a sequence of intermediate reasoning steps, and the final answer. What is the primary purpose of including the 'intermediate reasoning steps' in this context?
Evaluating Chain-of-Thought Demonstrations
Calculation Annotation in CoT Demonstrations
Final Answer Token in CoT Demonstrations
An arithmetic word problem is used to demonstrate a step-by-step reasoning process. The problem is: 'Boris starts with 100 apples. Beck has 23 fewer apples than Boris. Boris then gives 10 apples to Beck. How many more apples does Boris have than Beck in the end?' Arrange the following reasoning steps into the correct logical order to solve the problem.
Learn After
A prompt engineer is creating a few-shot example to teach a language model how to solve a word problem. The goal is to make the model's reasoning process, especially the arithmetic, as clear and explicit as possible for the model to learn from. Given the problem: 'A warehouse had 500 boxes. On Monday, they received a shipment of 150 new boxes. On Tuesday, they shipped out 75 boxes. How many boxes are left?', which of the following reasoning chains best achieves the goal by clearly demarcating the calculation steps?
Applying Calculation Annotation to a Reasoning Chain
A prompt engineer is crafting a few-shot demonstration to teach a language model how to solve multi-step financial problems. The goal is to create a clear, replicable reasoning pattern that the model can follow. Below are two versions of the reasoning chain for the same problem.
Problem: Calculate the total amount after 2 years if $1000 is invested at an annual interest rate of 5%, compounded annually.
Version A: The principal is $1000 and the rate is 5%. After the first year, the interest is 1000 * 0.05 = 50. The new principal is 1000 + 50 = 1050. After the second year, the interest is 1050 * 0.05 = 52.50. The final amount is 1050 + 52.50 = 1102.50. The final answer is $1102.50.
Version B: The principal is $1000 and the rate is 5%. For Year 1, we calculate the interest: ≪1000 * 0.05 = 50≫. The new principal is the original principal plus the interest: ≪1000 + 50 = 1050≫. For Year 2, we calculate the new interest: ≪1050 * 0.05 = 52.50≫. The final amount is the Year 1 principal plus the Year 2 interest: ≪1050 + 52.50 = 1102.50≫. The final answer is $1102.50.
Which version is a more effective demonstration for the language model, and why?