1Cademy - Segment Embedding

Learn Before

Input Embedding Formula in BERT-like Models

Definition

Segment Embedding

A segment embedding, denoted as $\mathbf{e}_{\mathrm{seg}}$ , is a new type of embedding that specifies whether a token belongs to the first sentence, $\mathrm{Sent}_{A}$ , or the second sentence, $\mathrm{Sent}_{B}$ . It is used alongside the regular token embedding and positional embedding to help the model distinguish between sentences.

Updated 2026-06-27

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Example of Input Embedding Composition for a Sentence Pair
A model processes a two-sentence input: 'The sky is blue. [SEP] Grass is green.'. To help the model distinguish between the two sentences, it uses a specific vector, Vec_A, for the first sentence and another vector, Vec_B, for the second. How are these vectors assigned to the tokens in the combined input sequence?
Debugging a Sentence-Pair Model
A model is given a two-sentence input: 'What is the capital of France? [SEP] Paris is the capital.'. The model uses one vector representation for the first sentence (let's call it Vec_A) and a different one for the second sentence (Vec_B). For the tokenized sequence below, what is the correct sequence of these vector labels that would be assigned to each token?

[CLS] What is the capital of France ? [SEP] Paris is the capital . [SEP]

The correct sequence is: ____. (Use a comma and space t
Formula for Input Embedding Composition

Learn Before

Related

Learn After