Learn Before
Formula for the Critique-Refine Cycle
The critique-refine cycle can be mathematically expressed as: . In this formula, represents the improved solution. It is generated by the function, which prompts a Large Language Model with three inputs: the original problem , the previous solution , and the feedback on that solution, denoted as . This feedback is provided by a verifier.

0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Feedback Mechanisms in the Critique Stage
Formula for the Critique-Refine Cycle
Termination Conditions for the Critique-Refine Cycle
An AI system is tasked with generating a Python function to calculate the factorial of a number. It produces an initial version of the code. A verifier then analyzes this code and provides the following feedback: 'The function fails for an input of 0.' To continue the iterative improvement process, what is the most effective next action?
Evaluating an Iterative Refinement Process
Forms of Verifier Feedback in Sequential Scaling
An AI system is engaged in an iterative process to generate a recipe for a vegan chocolate cake. Below are different elements from one cycle of this process. Match each element to its corresponding role within the improvement cycle.
Learn After
A user prompts a language model to write a short summary of a provided article. The model generates a one-paragraph summary. The user then provides feedback: 'This is too brief. Please expand on the key findings mentioned in the third section.' The model uses this information to generate a new, three-paragraph summary. In the context of the iterative improvement formula what does the term represent in this scenario?
The formula describes an iterative process for improving a solution. Based on this formula, place the following events in the correct logical order from start to finish.
Missing Component in an Improvement Loop