Learn Before
Concept

DDPG

DDPG is a milestone work for reinforcement learning as well. The major contribution of DDPG is that it introduces the actor-critic method, which directly output an action instead of searching through an action. But it uses the experience reply and fixed Q-target as well, which is a a great evidence of the importance of these two tricks as well. DDPG makes the application of reinforcement learning in nlp, recommender system or other practical commercial area achievable, although there are still many problems existing now.

0

3

Updated 2021-03-21

Tags

Data Science