Formula

Formula for Pruned Step-wise Expansion of the Hypothesis Set

In practical decoding algorithms, the search space consists of an exponentially large number of sequences. To prevent the computational load from growing exponentially with sequence length, strategies are used to prune the space. At each decoding step, the set of candidate sequences YiY_i is formed by applying a pruning function to the full set of expanded hypotheses: Yi=Prune(Yi1×V)Y_i = \mathrm{Prune}(Y_{i-1} \times V). The Prune()\mathrm{Prune}(\cdot) function selectively removes sequences less likely to result in high-quality outcomes. Consequently, the number of sequences under consideration is drastically reduced, ensuring that YiYi1V|Y_i| \ll |Y_{i-1}| \cdot |V|.

Image 0

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences