Formula

Predictive Inference by Maximizing Conditional Probability

In probabilistic models, the goal is to determine the best output sequence, y^\hat{\mathbf{y}}, for a given input sequence, x\mathbf{x}. This is accomplished by identifying the possible output sequence, y\mathbf{y}, that has the highest conditional probability. The formal expression for this predictive rule is: y^=arg maxy Prw(yx)\hat{\mathbf{y}} = \underset{\mathbf{y}}{\text{arg max}} \ \text{Pr}^w(\mathbf{y}|\mathbf{x}) This equation states that the prediction y^\hat{\mathbf{y}} is the argument (arg max) that maximizes the conditional probability Pr(yx)\text{Pr}(\mathbf{y}|\mathbf{x}). The probability distribution is parameterized by the model's weights, denoted by the superscript w.

Image 0

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.5 Inference - Foundations of Large Language Models

Ch.2 Generative Models - Foundations of Large Language Models