1Cademy - Formula for Soft Prompt Optimization by Minimizing KL Divergence

Learn Before

LLM Prediction with Full Context
LLM Prediction with Compressed Context
Kullback-Leibler Divergence
Alternative Methods for Soft Prompt Optimization

Formula

Formula for Soft Prompt Optimization by Minimizing KL Divergence

An alternative approach to optimizing soft prompts involves minimizing the Kullback-Leibler (KL) divergence between the output probability distribution from the full context, $\text{Pr}(\cdot|\mathbf{c}, \mathbf{z})$ , and the distribution from the soft prompt, $\text{Pr}(\cdot|\sigma, \mathbf{z})$ . The goal is to find the soft prompt $\hat{\sigma}$ that makes these two distributions as similar as possible. The optimization is expressed by the formula: $\hat{\sigma} = \underset{\sigma}{\arg\min}\, \text{KL}(\text{Pr}(\cdot|\mathbf{c}, \mathbf{z}) \|\| \text{Pr}(\cdot|\sigma, \mathbf{z}))$