Learn Before
MOOC-CS: Language-Matched Controls (Results) in Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls
Method Part 1: Multilingual Encoder + CJK Query Rewrite as a MOOC-CS Control (Auditable Strict-Parity Graph-RAG Paper)
Method Part 2: MOOC-CS Prerequisite Benchmark (Auditable Strict-Parity Graph-RAG Paper)
Multilingual Encoder Alone Does Not Improve MOOC-CS Recall (Hierarchical R@10 = 22.3 vs 23.1)
On MOOC-CS () with the English-templated queries kept fixed, switching the dense encoder from English MiniLM to a multilingual model does not improve recall: hierarchical Recall@ moves from to , and Adaptive (heuristic) drops from to . The result isolates the effect of encoder choice alone under the unchanged English template interface and shows that a multilingual encoder, on its own, does not address the English-template / Chinese-concept-name mismatch.
0
1
Tags
Science
Auditable Strict-Parity Evaluation of Prerequisite-Graph Retrieval for RAG under Leakage Controls
Related
Template Stripping on MOOC-CS Raises Hierarchical R@10 from 23.1 to 26.5 (MiniLM Encoder)
Multilingual Encoder Alone Does Not Improve MOOC-CS Recall (Hierarchical R@10 = 22.3 vs 23.1)
Multilingual Encoder + CJK-Only Queries Jumps MOOC-CS Hierarchical R@10 to 68.1
Graph Effect on MOOC-CS Is Conditional on Dense Seed Pool Quality
MiniLM Encoder + CJK-Only Queries on MOOC-CS: Hierarchical R@10 Rises from 23.1 to 26.5 with Flat Dense at 21.7
Multilingual Encoder + CJK-Only Queries Jumps MOOC-CS Hierarchical R@10 to 68.1
Template Stripping on MOOC-CS Raises Hierarchical R@10 from 23.1 to 26.5 (MiniLM Encoder)
Multilingual Encoder Alone Does Not Improve MOOC-CS Recall (Hierarchical R@10 = 22.3 vs 23.1)
MiniLM Encoder + CJK-Only Queries on MOOC-CS: Hierarchical R@10 Rises from 23.1 to 26.5 with Flat Dense at 21.7
Graph Effect on MOOC-CS Is Conditional on Dense Seed Pool Quality
Multilingual Encoder Alone Does Not Improve MOOC-CS Recall (Hierarchical R@10 = 22.3 vs 23.1)
Graph Effect on MOOC-CS Is Conditional on Dense Seed Pool Quality
MOOC-CS Error Taxonomy: Residual Failures Dominated by Distant Misses and Bilingual Aliasing
Language-Matched Seeding as a Prerequisite for Graph-Expansion Gains