Learn Before
Concept

Intuition behind Gradient Descent with Momentum

As shown in the picture below, for Gradient descent optimizer, we will have ups and downs in the vertical direction, but it continues to go right in the horizontal direction. By taking the average of the few previous gradients, you will decrease oscillations in the vertical direction by averaging out positive and negative values. And since all gradients point to the same direction horizontally, the result in the horizontal direction will remain a large value in the right direction.

Image 0

0

3

Updated 2021-03-31

Tags

Data Science