Learn Before
Concept

PCI Express (PCIe) in Deep Learning

The Peripheral Component Interconnect Express (PCIe) is a dedicated high-speed expansion bus that forms point-to-point connections between the CPU and critical components like GPUs, network interfaces, and storage. It offers massive bandwidth, such as up to 3232 GB/s on PCIe 4.0 in a 1616-lane slot, with single-digit microsecond latency (e.g., 55 μs). However, PCIe links are a precious resource because processors only support a limited number of lanes. For instance, AMD's EPYC 3 has 128128 lanes, Intel's Xeon has up to 4848 lanes per chip, while desktop-grade CPUs like the Ryzen 9 and Core i9 have 2020 and 1616 lanes, respectively. Because GPUs typically require 1616 lanes each and must share the bus with other bandwidth-intensive peripherals, the number of GPUs that can connect to the CPU at full speed is inherently limited. To minimize packet overhead during transmission across these limited links, deep learning systems prefer large bulk data transfers.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L