True/False

In an autoregressive language model generating a sequence of text, the matrix containing 'key' vectors for previously generated tokens is updated at each step. Consider a scenario where this matrix has been populated with vectors from the first 10 tokens. When the 11th token is processed and its corresponding key vector is generated, the update procedure involves replacing the key vector of the very first token with the new one to keep the matrix size constant.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science