1Cademy - Reward Model Implementation Debugging

Learn Before

Diagram of Reward Score Calculation using an LLM

Case Study

Reward Model Implementation Debugging

Based on the standard architecture for this type of system, what specific component is the engineer most likely missing in their implementation that would convert the final hidden state vector into a single scalar score, and what is its function?

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

An engineer is building a system to generate a single quality score for a model's text response based on an initial prompt. Their proposed process is as follows:
1. Concatenate the prompt tokens and the response tokens into a single sequence.
2. Feed this combined sequence into a language model to get a final-layer hidden state vector for every token.
3. Average all of these hidden state vectors to create a single representative vector.
4. Pass this single vector through a linear layer to produ
You are tasked with designing a system that uses a language model to generate a single numerical score representing the quality of a given text response to a prompt. Arrange the following steps into the correct logical sequence for this process.
Reward Model Implementation Debugging

Learn Before

Related