Learn Before
Example
Newton's Method on a Nonconvex Function
A critical flaw of standard Newton's Method emerges when optimizing nonconvex functions. The algorithm's update rule involves dividing by the Hessian matrix (or the second derivative in one dimension). If the objective function exhibits negative curvature—meaning the second derivative is negative, as seen in functions like —the update step can inadvertently move the parameters in a direction that increases the function's value, rather than minimizing it.
0
1
Updated 2026-05-15
Tags
D2L
Dive into Deep Learning @ D2L