Learn Before
Short Answer

Applying the Start of Sequence Token

A language model calculates the probability of the sentence 'The cat sat' as the product of conditional probabilities: P('The') * P('cat' | 'The') * P('sat' | 'The cat'). This formulation is incomplete because it lacks a starting context for the first word. Rewrite this probability calculation to correctly incorporate a special token, denoted as <SOS>, that signals the beginning of the sequence.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science