Multiple Choice

In a transformer model generating text, a matrix of 'key' vectors is maintained for all previously generated tokens. Suppose at a certain step, this matrix K contains vectors for two previous tokens and is represented as:

K = [[0.1, 0.5], [0.9, 0.2]]

The model then processes a new token and generates a corresponding new key vector k_new:

k_new = [0.4, 0.8]

Based on the standard procedure for expanding this matrix during text generation, what will the updated matrix K be after incorporating k_new?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science