1Cademy - Example of a Human Preference Ranking in RLHF

Learn Before

Listwise Ranking for Human Feedback in RLHF
Preference Notation in Human Feedback

Example

Example of a Human Preference Ranking in RLHF

In the data annotation stage of RLHF, human evaluators rank multiple model-generated outputs for a given prompt. For example, if four outputs are presented, an annotator's preference might be expressed with the ranking $y_1 \succ y_4 \succ y_2 \succ y_3$ . This indicates that $y_1$ is the most preferred response, followed by $y_4$ and $y_2$ , with $y_3$ being the least preferred.