1Cademy - A human evaluator is comparing pairs of AI-generated responses for two different user requests. Request 1 asks for a factual summary of a specific scientific process. Request 2 asks for a creative and engaging short story. How should the evaluators focus on different quality criteria shift between these two tasks?

Learn Before

Evaluation Criteria for Pairwise Comparison in RLHF

Multiple Choice

A human evaluator is comparing pairs of AI-generated responses for two different user requests. Request 1 asks for a factual summary of a specific scientific process. Request 2 asks for a creative and engaging short story. How should the evaluator's focus on different quality criteria shift between these two tasks?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related