Learn Before
Concept
Markov Process
A Markov Process is a tuple (S,P), where • S is a (finite) set of states • P is a state transition probability matrix. .
We make some constrains on states. A sequence of states is Markov if and only if the probability of moving to the next state depends only on the present state and not on the previous states . That is, for all t, In reinforcement learning, Markov Process is time-homogeneous. That is, the probability of the transition is independent of t: .
0
1
Updated 2025-08-31
Tags
Data Science