Learn Before
Concept
NVLink Connectivity in Multi-GPU Servers
Modern deep learning hardware often features bespoke network connectivity to handle large data transfers efficiently. For example, in an -GPU server, each GPU typically connects to a host CPU via a PCIe link operating at around GB/s. Simultaneously, each GPU may have multiple NVLink connections to other GPUs, each capable of bidirectionally transferring data at much higher speeds (e.g., Gbit/s or roughly GB/s per direction). Because the aggregate NVLink bandwidth significantly exceeds the PCIe bandwidth, maximizing training efficiency requires specialized synchronization protocols that exploit this hardware architecture.
0
1
Updated 2026-05-18
Tags
D2L
Dive into Deep Learning @ D2L