Concept

Greedy Search with Penalty Objective

In a greedy search algorithm, a decoding objective with a penalty term can be incorporated by evaluating candidate tokens and keeping only the single sequence that maximizes the penalized objective, Pr(yx)λPenalty(x,y)\Pr(\mathbf{y}|\mathbf{x}) - \lambda \cdot \mathrm{Penalty}(\mathbf{x},\mathbf{y}), at each individual decoding step.

0

1

Updated 2026-05-05

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences