1Cademy - A researcher wants to evaluate a language models ability to perform multi-step mathematical reasoning, where the model must execute a sequence of different calculations in the correct order to arrive at a final answer. Which of the following prompts is most effective for this specific evaluation goal?

Learn Before

Example of a Prompt for Calculating the Mean Square

Multiple Choice

A researcher wants to evaluate a language model's ability to perform multi-step mathematical reasoning, where the model must execute a sequence of different calculations in the correct order to arrive at a final answer. Which of the following prompts is most effective for this specific evaluation goal?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related