Learn Before
Comparing Data Structures for Reasoning Fine-Tuning
An AI development team is creating a dataset to improve a language model's ability to solve complex physics word problems. They are considering two approaches for structuring their data:
- Approach A: Each data point consists of the word problem (input) and only the final numerical answer (output).
- Approach B: Each data point consists of the word problem (input) and a detailed, step-by-step derivation of the solution, including the formulas used, intermediate calculations, and the final answer (output).
Which approach is more effective for teaching the model to reason through new, unseen problems? Justify your answer by explaining the underlying learning mechanism for the model in each case.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A development team wants to improve a language model's ability to solve multi-step logic puzzles. They plan to gather a large dataset of puzzles, where each entry consists only of the puzzle's description and its final, correct answer. They will then use this dataset to further train the model. Which statement provides the most accurate critique of this training strategy for its intended purpose?
Designing a Fine-Tuning Dataset for an AI Tutor
Comparing Data Structures for Reasoning Fine-Tuning