1Cademy - From Representation to Reward

Learn Before

Final Reward Score Calculation in RLHF

Short Answer

From Representation to Reward

In a reward model, a network first produces a high-dimensional vector that represents the combined meaning of a prompt and its response. This vector is then passed through a final output layer to produce the reward score. Analyze the fundamental difference between the information encoded in the high-dimensional vector and the information represented by the final single numerical score. What is the purpose of this transformation?

Updated 2025-10-10

Contributors are:

Who are from:

Learn Before

Related