Learn Before
Concept

Optimal Step Size according to Taylor Series Approximation

Denote the function as f(x)f(x), gg is the gradient and HH is is the Hessian at x(0)x^{(0)}. We calculate the new point x=x(0)ϵgx = x^{(0)} - \epsilon g. We can obtian that f(x(0)ϵg)f(x(0))ϵgTg+12ϵ2gTHgf(x^{(0)} - \epsilon g) \approx f(x^{(0)}) - \epsilon g^Tg + \frac{1}{2} \epsilon^2 g^THg According to the above equation, the optimal step size when gTHgg^THg is positive is ϵ=gTggTHg\epsilon^* = \frac{g^Tg}{g^THg}

0

1

Updated 2021-05-24

References


Tags

Data Science