Short Answer

Calculating Pre-Softmax Attention Scores

An attention mechanism computes scores for a single query vector q' against a key matrix K using the formula scores = q' * K^T * sqrt(d), where d is the vector dimension. Given q' = [2, 1], K = [[1, 0], [3, 2]], and d = 4, calculate the resulting scores vector.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science