1Cademy - Condition for Policy Invariance in Reward Shaping

Learn Before

Reward Shaping Formula

Concept

Condition for Policy Invariance in Reward Shaping

When using a transformed reward function, the choice of the shaping reward function is critical for maintaining the original optimal policy. To ensure that the agent's optimal behavior is not altered, the shaping function must be defined in a specific, constrained form rather than being an arbitrary addition.

Updated 2025-10-06

Contributors are: