1Cademy - Next-Layer Input Composition Formula in Prefix-Tuning

Learn Before

Formula

Next-Layer Input Composition Formula in Prefix-Tuning

In a prefix-tuned model, the complete hidden state for layer $l+1$ , denoted as $\mathbf{H}^{l+1}$ , is formed by concatenating the layer-specific trainable prefix vectors with the processed hidden states of the original input sequence. This composition is represented by the formula: $\mathbf{H}^{l+1} = \mathbf{p}_0^{l+1} \mathbf{p}_1^{l+1} \dots \mathbf{p}_n^{l+1} \overline{\mathbf{H}}^{l+1}$ where $\overline{\mathbf{H}}^{l+1}$ is the sequence of output hidden states corresponding to the original input, which can be further expanded as: $\mathbf{H}^{l+1} = \mathbf{p}_0^{l+1} \mathbf{p}_1^{l+1} \dots \mathbf{p}_n^{l+1} \mathbf{h}_0^{l+1} \mathbf{h}_1^{l+1} \dots \mathbf{h}_m^{l+1}$

0

1

Updated 2026-06-16

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course

Learn Before

Related

Learn After