Learn Before
Dataset

QASC Question Answering Benchmark

QASC (Question Answering via Sentence Composition) is a multi-hop multiple-choice science QA benchmark. It contains 9{,}980 grade-school-level science questions, each with 88 answer choices, together with an associated corpus of roughly 1717 million science sentences. Every question is annotated with two corpus facts that must be retrieved and composed to support the correct answer, and the decomposition into those facts is not evident from the question surface form, so retrieval must introduce new concepts or relations. The standard task is to choose the correct answer given the question and choices, and retrieval systems are evaluated by their ability to recover the two annotated supporting sentences from the corpus, typically with Recall@kk over a fixed candidate pool. Headline comparisons fix the encoder, candidate pool, cutoff, matching rule, and split policy so that retrievers (e.g., dense retrievers, ColBERTv2, RePlug) can be compared under matched conditions.

0

1

Updated 2026-05-16

Contributors are:

Who are from:

Tags

Science

Related