1Cademy - A team is training a language model to provide medical summaries for doctors. They find that using a single reward model trained on overall quality produces outputs that are often either factually accurate but too brief, or comprehensive but containing minor inaccuracies. To address this trade-off and improve the models reliability, which of the following approaches to designing the reward system is most likely to be successful?

Learn Before

Aspect-Based Reward Model Construction in RLHF

Multiple Choice

A team is training a language model to provide medical summaries for doctors. They find that using a single reward model trained on 'overall quality' produces outputs that are often either factually accurate but too brief, or comprehensive but containing minor inaccuracies. To address this trade-off and improve the model's reliability, which of the following approaches to designing the reward system is most likely to be successful?

Updated 2025-10-07

Contributors are:

Who are from:

Learn Before

Related