Example

Single vs. Dual GPU Training Comparison on Fashion-MNIST

Training the LeNet model on Fashion-MNIST with a batch size of 256256 and a learning rate of 0.20.2 on a single GPU achieves a test accuracy of approximately 0.830.83 at around 3.03.0 seconds per epoch. When the number of GPUs is increased to 22 while keeping the batch size and learning rate unchanged, the test accuracy remains roughly the same (approximately 0.840.84), confirming that the optimization algorithm is mathematically identical regardless of the number of devices. However, no meaningful speedup is observed because the model is too small and the dataset too limited for multi-GPU parallelism to overcome the significant Python overhead inherent in this from-scratch implementation. More complex models and more sophisticated parallelization approaches are needed to realize tangible performance benefits.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L