Formula

Formula for the Risk of an Output in Minimum Bayes Risk Decoding

The total risk of a candidate output y\mathbf{y} within a set of potential outputs Ω\varOmega is calculated as the expected value of the risk function R(y,yr)R(\mathbf{y}, \mathbf{y}_r) over all other outputs yr\mathbf{y}_r. This expectation is weighted by the conditional probability of each output yr\mathbf{y}_r given the input x\mathbf{x}. The formula is: Risk(y)=EyrPr(yrx)R(y,yr)=yrΩR(y,yr)Pr(yrx)\mathrm{Risk}(\mathbf{y}) = \mathbb{E}_{\mathbf{y}_r \sim \Pr(\mathbf{y}_r|\mathbf{x})} R(\mathbf{y},\mathbf{y}_r) = \sum_{\mathbf{y}_r \in \varOmega} R(\mathbf{y},\mathbf{y}_r) \cdot \Pr(\mathbf{y}_r|\mathbf{x})

Image 0

0

1

Updated 2026-04-30

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences