Learn Before
  • Reward in Reinforcement Learning

Bellman Equation

It's very important to understand how we define a basic reward function in reinforcement learning and its principia mathematica. The basic intuition of reward fucntion in reinforcement learning is the Bellman Equation, which describes the expected reward. And we want to maximize the expected reward. The Bellman Equation is: v(s)=E[Rt+1+λv(St+1)St=s]v(s) = E[R_{t+1}+\lambda v(S_{t+1})|S_t = s]

0

2

4 years ago

Tags

Data Science

Related
  • Reward vs. Value Function

  • Rewards, Returns and Value functions

  • Why Function Approximation is Needed?

  • Bellman Equation