1Cademy - Preference Notation in Human Feedback

Learn Before

Comparison of Annotation Methods for Human Feedback in RLHF

Formula

Preference Notation in Human Feedback

In the context of human feedback for language models, the notation $y_a \succ y_b$ is used to formally represent a preference. It signifies that a human annotator has judged output $y_a$ to be of higher quality or more desirable than output $y_b$ .

Updated 2025-10-08

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Example of a Human Preference Ranking in RLHF
Ranked Preference Notation
Example of Listwise Ranking in RLHF
A language model generates two different summaries for a given article: Summary 1 and Summary 2. A human evaluator is tasked with reviewing them and determines that Summary 1 is more coherent and factually accurate than Summary 2. How would this specific judgment be formally expressed using standard preference notation?
A human annotator provides the following judgments for four text completions (C1, C2, C3, C4) generated in response to a single prompt: C1 ≻ C4, C4 ≻ C2, and C2 ≻ C3. Based on this information, arrange the completions in order from most preferred to least preferred.
Limitations of Preference Notation

Learn Before

Related

Learn After