Learn Before
Concept

PCIe Multiplexer in Multi-GPU Servers

In multi-GPU systems, central processing units (CPUs) often possess an insufficient number of direct PCIe lanes to establish individual connections with every Graphics Processing Unit (GPU). For example, consumer-grade CPUs may only support 2424 lanes, while each GPU typically requires 1616 lanes for optimal throughput. To overcome this hardware limitation, systems employ a PCIe multiplexer, or switch. This switch acts as an intermediary hub that facilitates full-bandwidth communication directly between the connected devices. Because each GPU connects to the multiplexer at high speeds (e.g., 1616 GB/s on a 16x Gen3 link), the switch enables simultaneous, device-to-device data transfers that are significantly more efficient than routing all communication through the CPU.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L