Learn Before
Concept
Double Q-learning
Double Q learning introduces double estimator to deal with the over-estimation problem. For update of Q values, both estimators use the value of the other estimator. For selecting options, it's still a greedy algorithm based on the average value of two estimators.

0
2
Updated 2021-08-12
Contributors are:
Who are from:
Tags
Data Science