Learn Before
Example

One-Dimensional Gradient Descent on a Quadratic

To illustrate one-dimensional gradient descent concretely, consider minimizing the objective function f(x)=x2f(x) = x^2, whose derivative is f(x)=2xf'(x) = 2x. Although the minimum at x=0x = 0 is known analytically, applying gradient descent with an initial value of x=10x = 10 and a learning rate of η=0.2\eta = 0.2 demonstrates how the iterative update xxη2xx \leftarrow x - \eta \cdot 2x drives xx toward the optimum. After 1010 iterations, xx reaches approximately 0.06050.0605, confirming that the algorithm steadily reduces the function value and converges close to the true minimum.

0

1

Updated 2026-05-15

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L