Concept

Using Scoring Systems for Inference-Time Rescoring

This method involves employing a scoring system, which functions similarly to a reward model, to simulate human feedback on LLM outputs. The system assigns scores to different responses, allowing for the prioritization of those that receive more positive evaluations.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences