Example

Comparing Outcome-Based and Process-Based Evaluations of Math Responses

When evaluating an AI's responses to a math problem, an outcome-based approach treats any response with the correct final result as entirely correct, ignoring any flaws in the intermediate reasoning. In contrast, a process-based approach assesses the correctness of each individual step, allowing it to identify and account for mistakes made during the reasoning process even if the final answer is coincidentally correct. This detailed step-level evaluation is essential for effectively guiding the model's logic through reward modeling.

Image 0

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences