Learn Before
Concept
The Problem With Q-learning
Previously attributed to insufficiently flexible function approximation and noise. However, overestimations can occur regardless of the source of approximation error. Since imprecise value estimates are often used for learning, overestimations may be much more common.
0
2
Updated 2021-08-12
Contributors are:
Who are from:
Tags
Data Science