Learn Before
Short Answer

Comparing Data Structures for Reasoning Fine-Tuning

An AI development team is creating a dataset to improve a language model's ability to solve complex physics word problems. They are considering two approaches for structuring their data:

  • Approach A: Each data point consists of the word problem (input) and only the final numerical answer (output).
  • Approach B: Each data point consists of the word problem (input) and a detailed, step-by-step derivation of the solution, including the formulas used, intermediate calculations, and the final answer (output).

Which approach is more effective for teaching the model to reason through new, unseen problems? Justify your answer by explaining the underlying learning mechanism for the model in each case.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science