Concept

Bayesian Model Averaging for Combining Reward Models

As an alternative to simple weighted averaging, Bayesian model averaging can be used to combine predictions from an ensemble of reward models. This method aggregates the predictions by weighting each model based on its posterior probability, providing a principled way to account for model uncertainty.

0

1

Updated 2026-05-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Computing Sciences

Foundations of Large Language Models Course