Multiple Choice

A team is training a language model to solve complex, multi-step word problems. They observe that while the model frequently provides the correct final answer, its step-by-step explanation often contains logical fallacies or incorrect calculations that coincidentally cancel each other out. Which of the following training strategies would be most effective at correcting the model's flawed reasoning process, rather than just its final output?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science