Learn Before
In a system designed for automated self-refinement, the same model that generates the initial output must also be used to generate the feedback for that output.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Reward Models as an Example of Automated Feedback
Using LLMs for Feedback Generation
Feedback System Design for an AI Startup
A company is developing a system to iteratively improve the quality of its primary model's text summaries. They are considering using a separate, automated feedback model to score the summaries instead of relying on human reviewers. Which of the following represents the most significant trade-off the company must consider when choosing the automated approach?
In a system designed for automated self-refinement, the same model that generates the initial output must also be used to generate the feedback for that output.