Formula

LLM Prediction with Full Context

The prediction of a Large Language Model, denoted as y^\hat{y}, when provided with a full context cc and an input zz, is determined by selecting the output yy that maximizes the conditional probability. This process is formally expressed by the formula: y^=argmaxyPr(yc,z)\hat{y} = \underset{y}{\arg\max}\, \text{Pr}(y|c, z) This prediction often serves as the target or 'gold standard' when learning a compressed representation of the context.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences