Learn Before
Concept
Preference for Stochastic Gradient Descent with Large Datasets
As the number of examples in a training dataset increases, the computational cost of performing a single iteration of standard gradient descent grows significantly. Due to this high cost, stochastic gradient descent is generally preferred for large datasets.
0
1
Updated 2026-05-15
Tags
D2L
Dive into Deep Learning @ D2L