Formula

Potential-Based Shaping Function Formula

To ensure that reward shaping does not alter the optimal policy, the shaping reward function ff must be defined as the difference between potential values of successive states. This is known as a potential-based shaping function, given by the formula: f(st,at,st+1)=γΦ(st+1)Φ(st)f(s_t, a_t, s_{t+1}) = \gamma\Phi(s_{t+1}) - \Phi(s_t) Here, Φ\Phi is a real-valued potential function defined over the state space, and γ\gamma is the discount factor. This specific form of ff guarantees that the optimality of the policy is preserved.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences