Multiple Choice

An engineer is building a system to generate a single quality score for a model's text response based on an initial prompt. Their proposed process is as follows:

  1. Concatenate the prompt tokens and the response tokens into a single sequence.
  2. Feed this combined sequence into a language model to get a final-layer hidden state vector for every token.
  3. Average all of these hidden state vectors to create a single representative vector.
  4. Pass this single vector through a linear layer to produce the final scalar score.

Which statement best identifies a critical flaw in this proposed method for this specific task?

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science