Short Answer

Correcting a Misconception in Vector Generation

A fellow student claims that to generate the key vector for a specific token in an input sequence, you must use a different token from that sequence as the input. Identify the fundamental misunderstanding in this claim and explain the correct procedure for generating the query, key, and value vectors for that single token.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science