1Cademy - Pruning and K-Best Output in Beam Search

Learn Before

Ranking Stage in Beam Search

Activity (Process)

Pruning and K-Best Output in Beam Search

Following the ranking stage in beam search, a selection is made based on the beam width, denoted as K. The top K candidates with the highest probabilities are kept as the output, while all other lower-probability candidates are discarded or 'pruned'. For instance, with a beam width of K=3, the top three candidates ('cute', 'on', 'sick') would form the output, and the remaining candidates ('are', '.') would be pruned.

Updated 2025-10-10

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

In a text generation process, a set of potential next words and their calculated probabilities are ranked as follows: {'the': 0.45, 'a': 0.25, 'his': 0.15, 'her': 0.10, 'its': 0.05}. If the process uses a fixed width of K=3 to select the most likely candidates, which words are kept and which are discarded (pruned)?
Impact of Search Width on Text Generation
Applying Selection and Pruning in Text Generation

Learn Before

Related

Learn After