1Cademy - Reward Shaping as a Solution for Sparse Rewards

Learn Before

Reward vs. Value Function
Sparse Rewards in NLP

Concept

Reward Shaping as a Solution for Sparse Rewards

Reward shaping is a technique used to address the challenge of sparse rewards by providing more frequent, intermediate feedback to an agent. As proposed by Andrew Ng, it involves augmenting the original reward function with a potential-based function that depends only on the state. This addition guides the agent's learning without changing the optimal policy, helping to solve problems like meaningless iteration that can arise from delayed rewards.