Short Answer

Calculating a Linear Attention Output Vector

In a linear attention mechanism, at a specific timestep i, you are given the following:

  • Transformed query vector: q'_i = [2, 1]
  • Accumulated key-value state: μ_i = [[10, 5], [4, 8]]
  • Accumulated key state: ν_i = [6, 3]

Using the formula Output = (q'_i * μ_i) / (q'_i * ν_i), calculate the final output vector. Provide the calculated values for the numerator and the denominator before giving the final result.

0

1

Updated 2026-04-22

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science