CPU-Only Latency Protocol (Apple M4 Max, Caches Disabled, 200 Measured Queries After Warm-Up, Three Repeats)
Latency in this paper is measured under a fixed protocol designed to remove caching and warm-start confounds: Apple M4 Max CPU-only hardware, caches disabled, a warm-up phase followed by measured queries, and three repeats of the full measurement. The reported values are the mean within-repeat per-query standard deviation taken from the cited microbench JSONs, so the dispersion captures run-to-run variation within a repeat rather than across-repeat variance. Reporting latency under this CPU-only, cache-disabled, multi-repeat protocol makes the timings directly comparable across the paper's systems and benchmarks.
0
1
Tags
Science
Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls
Related
Statistical Protocol for Hierarchical Prerequisite Graph RAG: 5,000 Paired Bootstrap Resamples and Holm–Bonferroni
CPU-Only Latency Protocol (Apple M4 Max, Caches Disabled, 200 Measured Queries After Warm-Up, Three Repeats)
Primary and Secondary Retrieval Metrics in Hierarchical Prerequisite Graph RAG