1Cademy - Greedy Search (Greedy Decoding)

Learn Before

Incremental Calculation of Sequence Log-Probability
Deterministic Decoding Algorithms
Search Space Pruning in LLM Decoding

Concept

Greedy Search (Greedy Decoding)

Greedy search, also known as greedy decoding, is one of the most widely used decoding algorithms in natural language processing tasks, such as machine translation. The straightforward idea behind this method is to make locally optimal decisions at each generation step by selecting the next token that has the highest prediction probability. By continually picking the single most likely token, the process sequentially evaluates candidate sequences $\mathbf{y} = y_1...y_i \in Y_{i-1} \times V$ using their overall log-probability $\log \Pr(\mathbf{y}|\mathbf{x})$ .