Concept

Preference for Stochastic Gradient Descent with Large Datasets

As the number of examples in a training dataset increases, the computational cost of performing a single iteration of standard gradient descent grows significantly. Due to this high cost, stochastic gradient descent is generally preferred for large datasets.

0

1

Updated 2026-05-15

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L