1Cademy - Conditional Probability in Sequence-to-Sequence Generation

Learn Before

Mathematical Formulation of LLM Inference
General Notation for Conditional Probability in Sequence Generation

Formula

Conditional Probability in Sequence-to-Sequence Generation

In sequence-to-sequence models, the probability of generating a specific output token is conditioned on both the entire input sequence and all previously generated output tokens. This is represented by the formula $Pr(y_i | x_1, ..., x_m, y_1, ..., y_{i-1})$ , where $y_i$ is the current output token, $(x_1, ..., x_m)$ is the complete input sequence, and $(y_1, ..., y_{i-1})$ are the output tokens already generated. This conditional probability is the core calculation performed at each step of the auto-regressive generation process.