1Cademy - Applying the Start of Sequence Token

Learn Before

Start of Sentence (SOS) Token

Short Answer

Applying the Start of Sequence Token

A language model calculates the probability of the sentence 'The cat sat' as the product of conditional probabilities: P('The') * P('cat' | 'The') * P('sat' | 'The cat'). This formulation is incomplete because it lacks a starting context for the first word. Rewrite this probability calculation to correctly incorporate a special token, denoted as <SOS>, that signals the beginning of the sequence.

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related