Concept

Bellman Equation

It's very important to understand how we define a basic reward function in reinforcement learning and its principia mathematica. The basic intuition of reward fucntion in reinforcement learning is the Bellman Equation, which describes the expected reward. And we want to maximize the expected reward. The Bellman Equation is: v(s)=E[Rt+1+λv(St+1)St=s]v(s) = E[R_{t+1}+\lambda v(S_{t+1})|S_t = s]

0

2

Updated 2025-09-16

Tags

Data Science

Related