Learn Before
Concept

Model-Based Methods for Deep Reinforcement Learning

When a model of the environment ( (the estimated transition function and the estimated reward function) is available, the model can then act as a proxy for the actual environment.

  • In many games, the rule of the game is the model.
  • In other cases, it can be the law of Physics. Sometimes, we know how to model it and build simulators for it.

We can define this model with rules or equations. Or, we can use the Gaussian Process, Gaussian Mixture Model (GMM) or deep networks. To fit these models, we run a controller to collect sample trajectories and train the models with supervised learning.

0

1

Updated 2020-10-17

Tags

Data Science