1Cademy - Predictive Inference in Large Language Models

Learn Before

Formal Definition of the Predicted Value (ŷ)

Formula

Predictive Inference in Large Language Models

In the context of a Large Language Model (LLM), predictive inference involves selecting an output $\hat{y}$ by optimizing the model's probability distribution. This is formally expressed by combining the general prediction formula, which starts as $\hat{y} = \text{arg...}$ , with the specific probability function of the model, denoted as $\text{LLM Pr}_\theta^s(\cdot)$ . In this notation, $\theta$ represents the model's parameters, and the superscript s may refer to a specific scoring method.