1Cademy - A team is building a system to evaluate text sequences. They use a model that processes text one token at a time from left to right, where the output for any given token is influenced only by the tokens that came before it. To obtain a single vector that represents an entire input sequence for scoring, which of the following strategies is most appropriate for this type of model?

Learn Before

Sequence Representation for Reward Calculation in RLHF

Multiple Choice

A team is building a system to evaluate text sequences. They use a model that processes text one token at a time from left to right, where the output for any given token is influenced only by the tokens that came before it. To obtain a single vector that represents an entire input sequence for scoring, which of the following strategies is most appropriate for this type of model?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related