Multiple Choice

An engineer is designing a language model that uses a retrieval-based component for its attention mechanism. They observe that under a specific configuration, this retrieval-based model behaves identically to a sparse attention model that only considers previous tokens within the same input sequence. Which of the following configurations of the retrieval component's datastore would cause this functional equivalence?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science