Concept

Formulating NLP Tasks as Sequence-to-Sequence Mappings using Start Symbols

Natural Language Processing tasks can be formulated as sequence-to-sequence mappings by employing specific start symbols to differentiate the input (source) from the output (target). For instance, the token [CLS][\mathrm{CLS}] is conventionally used as the start symbol on the source side, while s\langle s \rangle is used as the start symbol on the target side. This unified representation allows diverse problems to be expressed in the exact same format.

0

1

Updated 2026-04-16

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences