1Cademy - Composition of Hidden States in a Prefix-Tuned Layer

Learn Before

Output Selection Formula in a Prefix-Tuned Transformer Layer

Formula

Composition of Hidden States in a Prefix-Tuned Layer

In a prefix-tuned model, the complete hidden state for layer $l+1$ , denoted as $\mathbf{H}^{l+1}$ , is formed by concatenating the prefix vectors with the processed hidden states of the original input sequence. This composition is represented by the formula: $\mathbf{H}^{l+1} = \mathbf{p}_0^{l+1} \mathbf{p}_1^{l+1} \dots \mathbf{p}_n^{l+1} \overline{\mathbf{H}}^{l+1}$ where $\overline{\mathbf{H}}^{l+1}$ is the sequence of output hidden states corresponding to the original input, which can be further expanded as: $\mathbf{H}^{l+1} = \mathbf{p}_0^{l+1} \mathbf{p}_1^{l+1} \dots \mathbf{p}_n^{l+1} \mathbf{h}_0^{l+1} \mathbf{h}_1^{l+1} \dots \mathbf{h}_m^{l+1}$

0

1

Updated 2025-10-08

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related

Learn After