Learn Before
Concept

Pros and Cons of Actor-Critic Method

Advantages: It could use one step adjustment, which leads to much faster training process compared to traditional policy-gradient method. Disadvantages: Actors' actions highly depend on critics' value. However, it's already hard to reach a convergence of the critics. It's even hard to reach the convergence with the actors. If there is no enough data, it may lead to a under-fitting.

0

2

Updated 2021-03-21

Tags

Data Science