Multiple Choice

A researcher wants to evaluate a language model's ability to perform multi-step mathematical reasoning, where the model must execute a sequence of different calculations in the correct order to arrive at a final answer. Which of the following prompts is most effective for this specific evaluation goal?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science