Learn Before
Single vs. Dual GPU Training Comparison on Fashion-MNIST
Training the LeNet model on Fashion-MNIST with a batch size of and a learning rate of on a single GPU achieves a test accuracy of approximately at around seconds per epoch. When the number of GPUs is increased to while keeping the batch size and learning rate unchanged, the test accuracy remains roughly the same (approximately ), confirming that the optimization algorithm is mathematically identical regardless of the number of devices. However, no meaningful speedup is observed because the model is too small and the dataset too limited for multi-GPU parallelism to overcome the significant Python overhead inherent in this from-scratch implementation. More complex models and more sophisticated parallelization approaches are needed to realize tangible performance benefits.
0
1
Tags
D2L
Dive into Deep Learning @ D2L