Formula

Transformer Layer Output Formula

The output of a Transformer layer at depth ll, denoted as Hl+1\mathbf{H}^{l+1}, is computed by applying the layer's transformation function to its input, Hl\mathbf{H}^{l}. This relationship can be expressed by the formula: Hl+1=Layer(Hl)\mathbf{H}^{l+1} = \mathrm{Layer}(\mathbf{H}^{l}).

Image 0

0

1

Updated 2026-04-30

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences