1Cademy - Formulating NLP Tasks as Sequence-to-Sequence Mappings using Start Symbols

Learn Before

Differentiating Encoder and Decoder Sequences with Start Symbols

Concept

Formulating NLP Tasks as Sequence-to-Sequence Mappings using Start Symbols

Natural Language Processing tasks can be formulated as sequence-to-sequence mappings by employing specific start symbols to differentiate the input (source) from the output (target). For instance, the token $[\mathrm{CLS}]$ is conventionally used as the start symbol on the source side, while $\langle s \rangle$ is used as the start symbol on the target side. This unified representation allows diverse problems to be expressed in the exact same format.