Formula

Mathematical Formulation of BoN Sampling

The set of NN output sequences, {y^1,...,y^N}\{\hat{\mathbf{y}}_1, ..., \hat{\mathbf{y}}_N\}, in Best-of-NN (BoN) sampling is formally defined as the top NN sequences that maximize the conditional probability Pr(yx)\Pr(\mathbf{y}|\mathbf{x}). This selection is expressed using the argTopN\mathrm{argTopN} operation, which returns the NN outputs that yield the highest values for the function:

{y^1,...,y^N}=argTopNy[Pr(yx)]\{\hat{\mathbf{y}}_1, ..., \hat{\mathbf{y}}_N\} = \mathop{\mathrm{argTopN}}_{\mathbf{y}} \left[ \Pr(\mathbf{y}|\mathbf{x}) \right]

Image 0

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.4 Alignment - Foundations of Large Language Models