Learning Analogy: Outcome vs. Process Feedback
The inadequacy of outcome-only feedback for complex tasks can be understood through an analogy with a student learning math. Informing the student only whether their final answer is correct or incorrect does not help them identify their mistakes. Effective learning requires process-based guidance, such as a step-by-step explanation of the solution, which clarifies the underlying logic and concepts.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Learning Analogy: Outcome vs. Process Feedback
A research team is training a language model to act as a programming assistant that writes complex, multi-step code functions. The training method rewards the model only if the final generated code executes without errors and produces the correct output. Despite extensive training, the model frequently generates code that is logically flawed, even if it sometimes produces the correct final result for the training examples. Which of the following statements best analyzes the fundamental weakness of this training approach?
Diagnosing Training Flaws in a Math AI
Critique of AI Training Methodologies for Complex Tasks
Learn After
Critique of an AI Training Methodology
A team is training a robot to assemble a complex piece of furniture from a kit. The assembly requires 20 distinct steps. Which of the following training approaches would be most effective for teaching the robot to perform this task reliably and to correct its own mistakes in the future?
Explaining Feedback Types with an Analogy