1Cademy - Value-Based Reward Shaping Formula

Learn Before

Potential-Based Shaping Function Formula

Formula

Value-Based Reward Shaping Formula

This formula presents a specific application of potential-based reward shaping where the state-value function, $V(s)$ , is used as the potential function, $\Phi(s)$ . The transformed reward, $r'$ , is calculated by augmenting the original environmental reward, $r$ , with a shaping term derived from the change in the discounted state value between the subsequent state, $s_{t+1}$ , and the current state, $s_t$ . The formula is expressed as: