1Cademy - Challenge of Encoder Pre-training Evaluation

Learn Before

Input and Output of a Sequence Encoder

Problem

Challenge of Encoder Pre-training Evaluation

Training an encoder model is not straightforward because there is no naturally occurring, gold-standard data available to directly measure the quality of the real-valued function's output vectors.

Updated 2026-04-15

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Tags

Foundations of Large Language Models

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

A model processes the input sentence 'The cat sat.' which is broken down into a sequence of 4 tokens: ['The', 'cat', 'sat', '.']. If this model functions as a sequence encoder, what is the most accurate description of the output it generates?
Model Output for a Token-Level Task
A sequence encoder processes an input sequence of 10 tokens and produces a single, fixed-size vector that represents the entire sequence's meaning.
Probabilistic Model for Text Classification using an Encoder-Classifier Architecture
Challenge of Encoder Pre-training Evaluation
Encoder Pre-training Output Architecture

Learn Before

Related