1Cademy - Reward Shaping Formula

Learn Before

Reward Shaping as a Solution for Sparse Rewards

Formula

Reward Shaping Formula

In reward shaping, a new reward signal, known as the transformed reward function $r'(s_t, a_t, s_{t+1})$ , is created by adding a shaping reward function $f(s_t, a_t, s_{t+1})$ to the environment's original reward function $r(s_t, a_t, s_{t+1})$ . This relationship is expressed by the formula: $r'(s_t, a_t, s_{t+1}) = r(s_t, a_t, s_{t+1}) + f(s_t, a_t, s_{t+1})$ This technique provides an agent with additional feedback, where all functions depend on the current state $s_t$ , action $a_t$ , and next state $s_{t+1}$ .