LectureBank-Full Configuration Used in Hierarchical Prerequisite RAG (208 Concepts, 899 Edges, 1,421 QA)
In this paper, LectureBank-Full is consumed as a prerequisite graph with concepts, prerequisite edges, and QA pairs derived from instructional materials. Graph descriptors computed from the released graph and concept texts are: mean depth , fraction of nodes with depth equal to , mean degree , and mean concept-text length tokens. The dataset is one of the two curated prerequisite benchmarks used for headline retrieval comparisons.
0
1
Tags
Science
Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls
Related
LectureBank-Full R@10 Gain from Diffusion and Role-Aware Quotas
LectureBank-Full Configuration Used in Hierarchical Prerequisite RAG (208 Concepts, 899 Edges, 1,421 QA)
LectureBank-Full Target-Disjoint R@10 Result (n=164): Diffusion Gain Survives, Adaptive Tied
LectureBank-Full Generation Diagnostic: Token-F1 1.9 → 18.3, EM Stays 0.0
LectureBank-Full Error Taxonomy: Residual Misses Are Near-Misses Along the Local Prerequisite Graph
LectureBank-Full Paired Delta: Adaptive vs Hierarchical Baseline = +0.7 [-2.1, +3.6]
Token-Cap Comparison on LectureBank-Full: Adaptive Loses More as Cap Tightens
LectureBank-Full Tight-Budget Advantage of Adaptive Depth Gating (Mean ΔR@k = +2.13 over k∈{1,2,3,4})
LectureBank-Full ΔR@k Peaks at k=4 (+6.4 Points, CI [1.0, 11.7])
LectureBank-Full Diffusion Gain over Static Parent Expansion (~18 R@10 Points)
Bounded Held-Out Targets After Strictest Leakage Control (21 LectureBank-Full, 18 MOOC-CS)
LectureBank-Full Decomposition: Diffusion+Quotas Drive ~18 R@10 Points; Contrast Gating Adds At Most ~1 Point (Statistically Tied)
LectureBank-Full Configuration Used in Hierarchical Prerequisite RAG (208 Concepts, 899 Edges, 1,421 QA)
MOOC-CS Configuration Used in Hierarchical Prerequisite RAG (225 Concepts, 516 Edges, 1,016 QA)
Canonical Prerequisite Splits Are Heavily Templated: 92/80 LectureBank-Full and 68/60 MOOC-CS Train-Test Overlaps
QASC Rebuilt as Directed Science-Fact Graph (16,444 Nodes, 25,590 Edges) Used Only for Validation Retrieval
Induced HotpotQA Slice as Auxiliary Stress Test in Supplement