An AI development team is using an inference-time technique to improve the quality of its model's responses. The process involves generating multiple candidate responses and then using a separate system to evaluate and select the best one. Arrange the following steps of this process in the correct chronological order.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Comprehension in Revised Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Improving Chatbot Response Quality without Retraining
An AI development team is using an inference-time rescoring process to select the best summary for a news article. The model first generates three candidate summaries. A separate scoring system then evaluates each candidate and assigns a single quality score from 0.0 to 1.0, where a higher score indicates a better summary. Given the following scores, which summary will be selected as the final output?
An AI development team is using an inference-time technique to improve the quality of its model's responses. The process involves generating multiple candidate responses and then using a separate system to evaluate and select the best one. Arrange the following steps of this process in the correct chronological order.