1Cademy - Sequential Scaling

Learn Before

Predict-then-Verify Approaches in LLM Reasoning
The Predict-then-Refine Paradigm in NLP

Concept

Sequential Scaling

Sequential scaling, also referred to as self-refinement, builds a sequence of solutions incrementally. It starts with an initial solution generated by a Large Language Model. Then, a verifier (often the same model) evaluates the solution in a critique stage to produce feedback, such as textual critiques, numerical scores, or revised plans. In the refine stage, the model uses the original problem, the current solution, and this feedback to generate a potentially improved solution. This critique-refine cycle is repeated iteratively, allowing the verifier to actively guide the generation process instead of simply selecting the best outcome from a static set of candidates.