Formula

Mathematical Formulation of the Search Problem in LLM Inference

The search problem for Large Language Model (LLM) inference can be mathematically re-expressed as finding the optimal output sequence, y^\hat{\mathbf{y}}, from the entire search space Y\mathcal{Y} that maximizes the conditional probability given the input sequence x\mathbf{x}. This is formally defined by the equation: y^=arg maxyYPr(yx)\hat{\mathbf{y}} = \argmax_{\mathbf{y} \in \mathcal{Y}} \Pr(\mathbf{y}|\mathbf{x})

Image 0

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related