Example

Canonical Prerequisite Splits Are Heavily Templated: 92/80 LectureBank-Full and 68/60 MOOC-CS Train-Test Overlaps

The canonical prerequisite splits used to train and evaluate prerequisite-QA systems are heavily templated, producing measurable train-test leakage. On LectureBank-Full, the canonical splits share 9292 exact train-test questions and 8080 shared train-test target concepts. On MOOC-CS, the canonical splits share 6868 exact train-test questions and 6060 shared train-test target concepts. These overlap counts are what motivate the paper's question-disjoint and target-concept-disjoint controls: without them, headline retrieval numbers on the canonical splits would partially reflect train-test template reuse rather than generalization.

0

1

Updated 2026-05-17

Contributors are:

Who are from:

Tags

Science

Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls