Formula

Conditional Probability of the Next Token

The conditional probability of a token xix_i given all its previous context tokens x0,...,xi1x_0,...,x_{i-1} is a fundamental concept in language modeling. It is mathematically denoted as Pr(xix0,...,xi1)\Pr(x_{i}|x_0,...,x_{i-1}). This probability represents the likelihood of the specific token xix_i appearing next in a sequence after the preceding tokens have been observed.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models