Learn Before
Concept

Rewards, Returns and Value functions

Rewards are very important as they are used for estimating value. Estimating rewards are relatively easy compared to calculating values which can be quite challenging. The values are calculated after each time step and after each time step it is important to receive highest value and not the highest reward.

"While return gives the expected discounted sum of rewards for one episode, a value function gives the expected discounted sum of rewards from a certain state"

0

1

Updated 2020-10-27

Tags

Data Science