Definition

Surrogate Objective in Reinforcement Learning

In reinforcement learning, a surrogate objective is an alternative objective function that is optimized in place of the true performance function. A common example is the off-policy objective, which uses importance sampling to evaluate the current policy with data from a reference policy: Eτπθref[Prθ(τ)Prθref(τ)R(τ)]\mathbb{E}_{\tau \sim \pi_{\theta_{\text{ref}}}} \left[ \frac{\text{Pr}_{\theta}(\tau)}{\text{Pr}_{\theta_{\text{ref}}}(\tau)} R(\tau) \right]. This formulation acts as a proxy, or surrogate, for the true on-policy objective Eτπθ[R(τ)]\mathbb{E}_{\tau \sim \pi_{\theta}} [R(\tau)].

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences