Multiple Choice

A machine learning engineer describes their text classification pipeline: 1) An input text is formatted with a special token at the beginning. 2) The token sequence is converted into embeddings. 3) A model processes the embeddings and outputs a sequence of hidden state vectors, one for each input token. 4) The hidden state vectors for all tokens except the special first one are averaged together. 5) This averaged vector is passed to a prediction network to get the final class. Which step represents a deviation from the standard, illustrated procedure for this task?

0

1

Updated 2025-10-09

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science