1Cademy - Consider the decoding objective for controllable text generation: `ŷ = argmax [Pr(y|x) - λ * Penalty(x, y)]`. If the hyperparameter `λ` is set to 0, the objective simplifies to finding the output with the highest conditional probability, effectively ignoring any penalty.

Learn Before

Decoding Objective with Penalty Term

True/False

Consider the decoding objective for controllable text generation: ŷ = argmax [Pr(y|x) - λ * Penalty(x, y)]. If the hyperparameter λ is set to 0, the objective simplifies to finding the output with the highest conditional probability, effectively ignoring any penalty.

Updated 2025-10-08

Contributors are: