1Cademy - Comparing Outcome-Based and Process-Based Evaluations of Math Responses

Learn Before

Comparison of Process and Outcome Reward Models

Example

Comparing Outcome-Based and Process-Based Evaluations of Math Responses

When evaluating an AI's responses to a math problem, an outcome-based approach treats any response with the correct final result as entirely correct, ignoring any flaws in the intermediate reasoning. In contrast, a process-based approach assesses the correctness of each individual step, allowing it to identify and account for mistakes made during the reasoning process even if the final answer is coincidentally correct. This detailed step-level evaluation is essential for effectively guiding the model's logic through reward modeling.

Updated 2026-05-03

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn Before

Related