Learn Before
An AI system generates four possible summaries for a user's request. A scoring mechanism then evaluates each summary for quality, assigning a numerical score where higher is better. Based on the scores below, which summary would be selected as the final output?
- Summary A: Score 0.85
- Summary B: Score -0.20
- Summary C: Score 1.50
- Summary D: Score 1.15
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Best Candidate Selection via Maximum Reward Score in BoN Sampling
An AI system generates four possible summaries for a user's request. A scoring mechanism then evaluates each summary for quality, assigning a numerical score where higher is better. Based on the scores below, which summary would be selected as the final output?
- Summary A: Score 0.85
- Summary B: Score -0.20
- Summary C: Score 1.50
- Summary D: Score 1.15
An AI system is designed to generate helpful and safe responses. For a given prompt, it first creates three distinct candidate responses. A secondary component then scores each candidate for helpfulness and safety, and the response with the highest score is selected as the final output. If the system ultimately produces a response that is factually incorrect and unhelpful, which of the following is the most likely point of failure in the process?
Consider a system that first generates a diverse set of potential answers to a prompt and then uses a separate scoring component to select the single best answer to show the user. In this system, the quality of the final, user-facing answer is determined exclusively by the quality of the initial set of potential answers.