Learn Before
Concept

Learning Rate in Newton's Method

To mitigate the risk of divergence or moving in the wrong direction on nonconvex surfaces, the standard Newton's Method update can be modified by incorporating a learning rate, η\eta. By taking a fraction of the full calculated step, the algorithm retains the benefits of second-order information—being cautious in regions of high curvature and taking longer steps in flatter regions—while significantly increasing optimization stability.

Image 0

0

1

Updated 2026-05-15

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L