Learn Before
PCIe Multiplexer in Multi-GPU Servers
In multi-GPU systems, central processing units (CPUs) often possess an insufficient number of direct PCIe lanes to establish individual connections with every Graphics Processing Unit (GPU). For example, consumer-grade CPUs may only support lanes, while each GPU typically requires lanes for optimal throughput. To overcome this hardware limitation, systems employ a PCIe multiplexer, or switch. This switch acts as an intermediary hub that facilitates full-bandwidth communication directly between the connected devices. Because each GPU connects to the multiplexer at high speeds (e.g., GB/s on a 16x Gen3 link), the switch enables simultaneous, device-to-device data transfers that are significantly more efficient than routing all communication through the CPU.
0
1
Tags
D2L
Dive into Deep Learning @ D2L