1Cademy - Greedy Search with Penalty Objective

Learn Before

Decoding Objective with Penalty Term

Concept

Greedy Search with Penalty Objective

In a greedy search algorithm, a decoding objective with a penalty term can be incorporated by evaluating candidate tokens and keeping only the single sequence that maximizes the penalized objective, $\Pr(\mathbf{y}|\mathbf{x}) - \lambda \cdot \mathrm{Penalty}(\mathbf{x},\mathbf{y})$ , at each individual decoding step.

Updated 2026-05-05

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related