Activity (Process)

Pruning and K-Best Output in Beam Search

Following the ranking stage in beam search, a selection is made based on the beam width, denoted as K. The top K candidates with the highest probabilities are kept as the output, while all other lower-probability candidates are discarded or 'pruned'. For instance, with a beam width of K=3, the top three candidates ('cute', 'on', 'sick') would form the output, and the remaining candidates ('are', '.') would be pruned.

Image 0

0

1

Updated 2025-10-10

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences