1Cademy - Using Scoring Systems for Inference-Time Rescoring

Learn Before

Rescoring and Reranking for Inference-Time Alignment

Concept

Using Scoring Systems for Inference-Time Rescoring

This method involves employing a scoring system, which functions similarly to a reward model, to simulate human feedback on LLM outputs. The system assigns scores to different responses, allowing for the prioritization of those that receive more positive evaluations.

Updated 2025-10-07

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Improving Chatbot Response Quality without Retraining
An AI development team is using an inference-time rescoring process to select the best summary for a news article. The model first generates three candidate summaries. A separate scoring system then evaluates each candidate and assigns a single quality score from 0.0 to 1.0, where a higher score indicates a better summary. Given the following scores, which summary will be selected as the final output?
An AI development team is using an inference-time technique to improve the quality of its model's responses. The process involves generating multiple candidate responses and then using a separate system to evaluate and select the best one. Arrange the following steps of this process in the correct chronological order.

Learn Before

Related

Learn After