Concept

High-Bandwidth GPU Memory Technologies

Because GPUs possess significantly more processing elements than CPUs, they require vastly higher memory bandwidth to avoid starving their compute units. To satisfy this demand, GPU architectures employ two primary strategies. First, they utilize much wider memory buses, such as the 352352-bit-wide bus found on NVIDIA’s RTX 2080 Ti. Second, they rely on specialized high-performance memory chips. Consumer-grade accelerators typically use GDDR6 modules, offering over 500500 GB/s of aggregate bandwidth. In contrast, high-end server accelerators, like the NVIDIA Volta V100, use High Bandwidth Memory (HBM). HBM modules connect directly to the GPU on a dedicated silicon wafer, delivering massive speed but significantly increasing manufacturing costs. Consequently, while GPU memory is functionally similar to CPU memory, it is substantially faster but generally much smaller in capacity.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L