Learn Before
Concept

CPU Cache Latencies

In a standard CPU architecture, memory access latency increases as data moves further from the execution cores. An L1 cache reference is extremely fast, taking approximately 1.51.5 ns (roughly 44 cycles). If data is not in the L1 cache, an L2 cache hit takes about 55 ns (1212 to 1717 cycles). Retrieving data from the L3 cache incurs higher latency, ranging from 1616 ns (4242 cycles) for an unshared cache, to 25252929 ns if shared or modified in another core, and up to 4040 ns (100100 to 300300 cycles) if the cache is located on a remote CPU socket.

0

1

Updated 2026-05-18

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L