Concept

Greedy Search (Greedy Decoding)

Greedy search, also known as greedy decoding, is one of the most widely used decoding algorithms in natural language processing tasks, such as machine translation. The straightforward idea behind this method is to make locally optimal decisions at each generation step by selecting the next token that has the highest prediction probability. By continually picking the single most likely token, the process sequentially evaluates candidate sequences y=y1...yiYi1×V\mathbf{y} = y_1...y_i \in Y_{i-1} \times V using their overall log-probability logPr(yx)\log \Pr(\mathbf{y}|\mathbf{x}).

Image 0

0

1

Updated 2026-05-14

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

D2L

Dive into Deep Learning @ D2L

Related
Learn After