Debugging an LLM's Arithmetic Process
A developer is testing a new AI model integrated with an external calculator. The system is designed to trigger the calculator only when it encounters a mathematical expression enclosed in ≪...≫ that also contains an equals sign (=) to signal the end of the expression to be evaluated. The developer gives the model a word problem, and it generates three different potential responses. Evaluate each response below and explain why it would or would not successfully trigger the external calculator according to the specified rule.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A large language model is configured to offload arithmetic to an external tool. The tool is activated only when it detects an expression enclosed in
≪...≫that is immediately followed by an equals sign (=). Given this rule, analyze the following model outputs and identify which one would successfully trigger the external tool to perform the calculation.Debugging an LLM's Arithmetic Process
Constructing a Calculation-Enabled Model Output