Learn Before
Concept

Back-Propagating through Discrete Stochastic Operations

The cost function for machine learning models having discrete outputs are not differentiable at certain points, in order to mitigate this effect expectation of the cost function is chosen for gradient descent algorithm and this is idea convinced by the "REINFORCE (REward Increment = nonnegative Factor×Offset Reinforcement×Characteristic Eligibility)"algorithm

0

1

Updated 2021-07-29

References


Tags

Data Science

Related