Learn Before
Concept

Double Q-learning

Double Q learning introduces double estimator to deal with the over-estimation problem. For update of Q values, both estimators use the value of the other estimator. For selecting options, it's still a greedy algorithm based on the average value of two estimators.

Image 0

0

2

Updated 2021-08-12

Tags

Data Science

Learn After