Learn Before
Concept
CPU Cache Latencies
In a standard CPU architecture, memory access latency increases as data moves further from the execution cores. An L1 cache reference is extremely fast, taking approximately ns (roughly cycles). If data is not in the L1 cache, an L2 cache hit takes about ns ( to cycles). Retrieving data from the L3 cache incurs higher latency, ranging from ns ( cycles) for an unshared cache, to – ns if shared or modified in another core, and up to ns ( to cycles) if the cache is located on a remote CPU socket.
0
1
Updated 2026-05-18
Tags
D2L
Dive into Deep Learning @ D2L