Concept

Input Formatting with Separator Tokens

For tasks involving multiple text segments, a common input formatting strategy is to concatenate the sequences and use a special separator token, like [SEP], to distinguish between them. For instance, an input sequence xmx_m and a target sequence y1,y2,,yny_1, y_2, \dots, y_n can be combined into a single input stream for a model as: xmx_m [SEP] y1y2yny_1 y_2 \dots y_n [SEP].

0

1

Updated 2026-01-15

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences