1Cademy - Causality Constraint in Reinforcement Learning

Learn Before

Information

Concept

Causality Constraint in Reinforcement Learning

In reinforcement learning, policy decisions operate under a causality constraint. This means that an action selected at a specific time step, $t$ , can only impact rewards obtained at or after that time ( $r_t, r_{t+1}, \dots$ ). Rewards received prior to time $t$ are considered unchangeable or 'fixed' from the perspective of the action at $t$ .