1Cademy - Predict-then-Verify Approaches in LLM Reasoning

Learn Before

Approaches to Multi-Step Reasoning in LLMs
Training-Free Methods for Scaling LLM Reasoning

Concept

Predict-then-Verify Approaches in LLM Reasoning

The fundamental principle of the predict-then-verify approach is that for a given input, such as a math problem, a model can generate multiple potential output sequences or solutions. A separate verifier or selection mechanism then evaluates each of these generated solutions to identify and select the best one. This entire selection process can be framed as a search problem. Best-of-N sampling is a key example of this method.