Learn Before
Formula

Hessian-Vector Product Formula

When a model has millions of parameters, the full Hessian matrix is computationally expensive to calculate and store. Krylov methods offer an alternative optimization approach by only requiring the product between the Hessian and an arbitrary vector. For a function f:mathbb{R}^nrightarrow mathbb{R} with a Hessian H\mathbf{H} and an arbitrary vector vv, this Hessian-vector product can be evaluated using only gradient operations: Hv=x[(xf(x))v]\mathbf{H}v=\nabla_{\mathbf{x}}[(\nabla_{\mathbf{x}}f(x))^{\top}v].

0

1

Updated 2026-06-14

References


Tags

Data Science