1Cademy - Positional Context in Concatenated Sequences

Learn Before

Initial Representation for Concatenated [x, y] Sequences

Short Answer

Positional Context in Concatenated Sequences

Consider two token sequences, x and y. A language model calculates two separate quantities: one by processing the sequence x followed by y (concatenated as [x, y]), and another by processing the sequence y by itself. Explain why the initial input vector for the first token of y will be different in these two scenarios. Your explanation should identify the specific component of the input vector that changes and the reason for this change.

Updated 2025-10-05

Contributors are:

Who are from:

Learn Before

Related