1Cademy - Calculating Future Return

Learn Before

Sum of Future Rewards Notation

Short Answer

Calculating Future Return

An agent is interacting with an environment. At the current time step, t=1, it observes a sequence of rewards for the remainder of the episode, which ends at T=4. The rewards are as follows: r_1 = -1, r_2 = 5, r_3 = 0, r_4 = 10. Calculate the value of the future return, represented by the notation $\sum_{k=1}^{4} r_k$ , and briefly explain what this calculated value signifies for the agent's experience from time step 1 onwards.

Updated 2025-10-09

Contributors are:

Who are from:

Learn Before

Related