1Cademy - Formula for the Critique-Refine Cycle

Learn Before

Critique-Refine Cycle in Sequential Scaling

Formula

Formula for the Critique-Refine Cycle

The critique-refine cycle can be mathematically expressed as: $\mathbf{y}_{k+1} = \mathrm{Refine}(\mathbf{x}, \mathbf{y}_k, \mathrm{Feedback}(\mathbf{y}_k))$ . In this formula, $\mathbf{y}_{k+1}$ represents the improved solution. It is generated by the $\mathrm{Refine}(\cdot)$ function, which prompts a Large Language Model with three inputs: the original problem $\mathbf{x}$ , the previous solution $\mathbf{y}_k$ , and the feedback on that solution, denoted as $\mathrm{Feedback}(\mathbf{y}_k)$ . This feedback is provided by a verifier.

Updated 2026-05-06

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

A user prompts a language model to write a short summary of a provided article. The model generates a one-paragraph summary. The user then provides feedback: 'This is too brief. Please expand on the key findings mentioned in the third section.' The model uses this information to generate a new, three-paragraph summary. In the context of the iterative improvement formula $y_{k+1} = \text{Refine}(x, y_k, \text{Feedback}(y_k))$ what does the term $y_k$ represent in this scenario?
The formula $y_{k+1} = \text{Refine}(x, y_k, \text{Feedback}(y_k))$ describes an iterative process for improving a solution. Based on this formula, place the following events in the correct logical order from start to finish.
Missing Component in an Improvement Loop

Learn Before

Related

Learn After