Example

Example of Ranking LLM Outputs

To illustrate the process of ranking Large Language Model generations, suppose a model produces four different responses regarding environmental sustainability, denoted as y1\mathbf{y}_1 through y4\mathbf{y}_4. For instance, y1\mathbf{y}_1 suggests using electric vehicles, y2\mathbf{y}_2 advocates minimalism, y3\mathbf{y}_3 proposes going off-grid, and y4\mathbf{y}_4 recommends supporting local farms. Rather than assigning an independent numerical score to each option, human annotators compare the set and arrange them by preference. A possible ranking outcome might be y1y4y2y3\mathbf{y}_1 \succ \mathbf{y}_4 \succ \mathbf{y}_2 \succ \mathbf{y}_3, demonstrating that y1\mathbf{y}_1 is the most favored response while y3\mathbf{y}_3 is the least favored.

0

1

Updated 2026-04-20

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.2 Generative Models - Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course