Learn Before
Concept

The Problem With Q-learning

Previously attributed to insufficiently flexible function approximation and noise. However, overestimations can occur regardless of the source of approximation error. Since imprecise value estimates are often used for learning, overestimations may be much more common.

0

2

Updated 2021-08-12

Tags

Data Science