1Cademy - LLM Prediction with Compressed Context

Learn Before

General Formula for Prediction via Maximum Probability

Formula

LLM Prediction with Compressed Context

The prediction of a Large Language Model, denoted as $\hat{y}_{\sigma}$ , when using a soft prompt $\sigma$ (a compressed context) and an input $z$ , is determined by selecting the output $y$ that maximizes the conditional probability. This is formally expressed as: $\hat{y}_{\sigma} = \underset{y}{\arg\max}\, \text{Pr}(y|\sigma, z)$ This prediction is compared against the prediction from the full context to optimize the soft prompt.