Multiple Choice

A research team is using a large language model to automatically generate preference labels for pairs of responses to user queries. They observe that for queries requiring nuanced reasoning, the model's preference labels are inconsistent and often seem arbitrary. Which of the following prompt engineering strategies would be most effective at improving the consistency and quality of the preference labels in this scenario?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science