Multiple Choice

An engineer is implementing a reward model where the final scalar score r is computed from the last hidden state vector h_last using the formula r = h_last * W_r. If the hidden state vector h_last has dimensions of [1 x 4096], what must be the dimensions of the weight matrix W_r for the formula to produce a single scalar value?

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science