Learn Before
Concept
Policy-Based Methods for Deep Reinforcement Learning
Policy-based methods learn the policy function directly, without calculating a value function for each action. These methods use a learning signal derived from sampling instantiations of policy parameters and the set of policies is developed towards policies that achieve better returns.
An example of a policy-based algorithm is Policy Gradient Methods.
0
1
Updated 2020-10-17
Tags
Data Science