Concept

Evaluation Criteria for Pairwise Comparison in RLHF

When human experts perform pairwise comparisons in RLHF, they evaluate the two presented outputs based on specific criteria. These criteria often include the clarity, relevance, and accuracy of the responses, guiding their decision on which output is preferable.

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences