Learn Before
Formula
Hessian-Vector Product Formula
When a model has millions of parameters, the full Hessian matrix is computationally expensive to calculate and store. Krylov methods offer an alternative optimization approach by only requiring the product between the Hessian and an arbitrary vector. For a function f:mathbb{R}^nrightarrow mathbb{R} with a Hessian and an arbitrary vector , this Hessian-vector product can be evaluated using only gradient operations: .
0
1
Updated 2026-06-14
Tags
Data Science