Multiple Choice

In a self-attention mechanism where a query at a given position i can only interact with keys at positions j where j <= i, how many total query-key dot product computations are performed for an input sequence of length 5 (indexed 0 to 4)?

0

1

Updated 2025-09-28

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science