1Cademy - Formula for Soft Prompt Optimization via Log-Likelihood Maximization

Objective 1: Adjust the soft prompt σ to maximize the probability of the model generating the exact target sentence ŷ.
Objective 2: Adjust the soft prompt σ so that the model&#x27;s entire probability distribution over the next possible word matches the distribution it would have had if it were conditioned on the full, original context instead of the prompt.

Learn Before

LLM Prediction with Full Context
Alternative Methods for Soft Prompt Optimization

Formula

Formula for Soft Prompt Optimization via Log-Likelihood Maximization

The optimal soft prompt, denoted as $\hat{\sigma}$ , can be found by maximizing the log-probability of the target prediction $\hat{y}$ (derived from the full context). This optimization is conditioned on the soft prompt $\sigma$ and the original input $z$ . The formula is expressed as: $\hat{\sigma} = \underset{\sigma}{\arg\max}\, \log \text{Pr}(\hat{y}|\sigma, z)$ This approach frames the optimization problem as a maximum likelihood estimation task, where the goal is to find the prompt that makes the desired output most probable.