1Cademy - One-Dimensional Gradient Descent

Learn Before

Gradient Descent

Concept

One-Dimensional Gradient Descent

One-dimensional gradient descent provides a clear illustration of why moving in the negative gradient direction reduces the objective function. For a continuously differentiable function $f: \mathbb{R} ightarrow \mathbb{R}$ , the first-order Taylor expansion gives $f(x + \epsilon) = f(x) + \epsilon f'(x) + \mathcal{O}(\epsilon^2)$ . Setting the step as $\epsilon = -\eta f'(x)$ , where $\eta > 0$ is a fixed learning rate, yields $f(x - \eta f'(x)) = f(x) - \eta f'^2(x) + \mathcal{O}(\eta^2 f'^2(x))$ . When the derivative $f'(x) eq 0$ , the term $\eta f'^2(x) > 0$ guarantees a decrease in $f$ , provided $\eta$ is small enough for the higher-order terms to be negligible. This leads to the update rule x leftarrow x - eta f'(x), which is applied iteratively from an initial value until a stopping condition is met, such as when the gradient magnitude $|f'(x)|$ becomes sufficiently small or a maximum number of iterations is reached.

0

1

Updated 2026-05-15

Contributors are:

Who are from:

References

Dive into Deep Learning

Learn Before

Related

Learn After