1Cademy - Simultaneous Processing of Input Context Tokens

Learn Before

Text Generation from an Initial Context

Concept

Simultaneous Processing of Input Context Tokens

In the initial stage of text generation, all tokens that constitute the input context, denoted as 'x', are provided to the model simultaneously for parallel processing.

Updated 2025-10-07

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn After

When a language model generates a response, it first processes the user's entire input prompt and then generates the output one token at a time. How does the computational approach for these two phases typically differ in terms of how tokens are handled?
When a language model is given an initial text prompt, it processes the tokens of that prompt one by one, in the order they appear, before it starts generating a response.
Processing Asymmetry in Text Generation

Learn Before

Related

Learn After