1Cademy - LLM Prediction with Full Context

Learn Before

General Formula for Prediction via Maximum Probability

Formula

LLM Prediction with Full Context

The prediction of a Large Language Model, denoted as $\hat{y}$ , when provided with a full context $c$ and an input $z$ , is determined by selecting the output $y$ that maximizes the conditional probability. This process is formally expressed by the formula: $\hat{y} = \underset{y}{\arg\max}\, \text{Pr}(y|c, z)$ This prediction often serves as the target or 'gold standard' when learning a compressed representation of the context.