Formula

Combined Reward Formula

The combined reward, rcombiner_{\mathrm{combine}}, is calculated by taking a weighted average of the outputs from KK different reward models. Each individual reward model's output, rk(x,y)r_k(\mathbf{x}, \mathbf{y}), is multiplied by a weight wkw_k. These products are summed up over all KK models, and the result is normalized by dividing by KK. The formula is expressed as:

rcombine=1Kk=1Kwkrk(x,y)r_{\mathrm{combine}} = \frac{1}{K} \sum_{k=1}^{K} w_k \cdot r_k(\mathbf{x}, \mathbf{y})

Image 0

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences