Learn Before
Graphics Processing Unit (GPU) in Deep Learning
Graphics Processing Units (GPUs) fundamentally transformed deep learning by providing the immense computational throughput required to train deep neural networks. Originally developed to accelerate computer graphics by rapidly performing matrix-vector products, this highly parallel architecture proved perfectly suited for the dense linear algebra operations and convolutions inherent in neural networks. The development of early GPU-accelerated convolution libraries, such as cuda-convnet developed by Alex Krizhevsky and Ilya Sutskever for two NVIDIA GTX 580s, enabled the training of massive models like AlexNet. This hardware breakthrough made deep, data-hungry architectures computationally feasible, igniting the modern deep learning boom.
0
1
Tags
D2L
Dive into Deep Learning @ D2L
Learn After
CPU vs. GPU Architecture in Deep Learning
AlexNet Convolutional Neural Network
cuda-convnet
General-Purpose GPUs (GPGPUs)
GPU Hardware Configurations in Deep Learning
High-Bandwidth GPU Memory Technologies
Hardware Accelerators for Inference
Hardware Accelerators for Training
NVIDIA Collective Communications Library (NCCL)
Parallelization on Multiple GPUs