1Cademy - DDPG

Learn Before

Actor-Critic Methods

Concept

DDPG

DDPG is a milestone work for reinforcement learning as well. The major contribution of DDPG is that it introduces the actor-critic method, which directly output an action instead of searching through an action. But it uses the experience reply and fixed Q-target as well, which is a a great evidence of the importance of these two tricks as well. DDPG makes the application of reinforcement learning in nlp, recommender system or other practical commercial area achievable, although there are still many problems existing now.

Updated 2021-03-21

Contributors are:

Ge Zhang

🏆 4

Who are from:

University of Michigan - Ann Arbor

🏆 4

References

Useful Tutorials about Math behind Reinforcement Learning

Learn Before

Related