Learn Before
Concept

Policy-Based Methods for Deep Reinforcement Learning

Policy-based methods learn the policy function directly, without calculating a value function for each action. These methods use a learning signal derived from sampling instantiations of policy parameters and the set of policies is developed towards policies that achieve better returns.

An example of a policy-based algorithm is Policy Gradient Methods.

0

1

Updated 2020-10-17

Tags

Data Science