1Cademy - A team is developing a model to automatically assign a quality score to an AI-generated response. To do this, the model must be given some text as input. Which of the following best explains why the model should be given the original prompt concatenated with the AIs response, instead of just the AIs response alone?

Learn Before

Input Formulation for the RLHF Reward Model

Multiple Choice

A team is developing a model to automatically assign a quality score to an AI-generated response. To do this, the model must be given some text as input. Which of the following best explains why the model should be given the original prompt concatenated with the AI's response, instead of just the AI's response alone?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related