1Cademy - Sum of Future Rewards Notation

Learn Before

State in Reinforcement Learning

Formula

Sum of Future Rewards Notation

The notation $\sum_{k=t}^{T} r_k$ represents the sum of future rewards, also known as the future return. It is the total reward collected from the current time step $t$ to the end of an episode at time step $T$ .

Updated 2025-10-09

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

An agent interacts with an environment over an episode that lasts for 5 time steps (from time step 0 to 4). The sequence of rewards received by the agent is: -1, 0, 5, -2, 10. What is the value of the future return, represented by the notation $\sum_{k=t}^{T} r_k$ , if the current time step t is 2 and the final time step T is 4?
Consider an agent interacting with an environment over a single episode. The future return is calculated as the sum of all rewards from a specific time step t to the final time step T, represented by the notation $\sum_{k=t}^{T} r_k$ . True or False: For any two consecutive time steps t and t+1 within the episode, the future return calculated from t will be greater than the future return calculated from t+1 if and only if the immediate reward received at time step t, denoted as $r_t$ , is positive.
Calculating Future Return

Learn Before

Related

Learn After