Formula

Preference Notation in Human Feedback

In the context of human feedback for language models, the notation yayby_a \succ y_b is used to formally represent a preference. It signifies that a human annotator has judged output yay_a to be of higher quality or more desirable than output yby_b.

Image 0

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences