1Cademy - Learning Analogy: Outcome vs. Process Feedback

Learn Before

Insufficiency of Outcome-Based Rewards for Complex Reasoning

Example

Learning Analogy: Outcome vs. Process Feedback

The inadequacy of outcome-only feedback for complex tasks can be understood through an analogy with a student learning math. Informing the student only whether their final answer is correct or incorrect does not help them identify their mistakes. Effective learning requires process-based guidance, such as a step-by-step explanation of the solution, which clarifies the underlying logic and concepts.

Updated 2025-10-10

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

Critique of an AI Training Methodology
A team is training a robot to assemble a complex piece of furniture from a kit. The assembly requires 20 distinct steps. Which of the following training approaches would be most effective for teaching the robot to perform this task reliably and to correct its own mistakes in the future?
Explaining Feedback Types with an Analogy

Learn Before

Related

Learn After