1Cademy - Transformer Layer Output Formula

Learn Before

Input Representation in a Transformer Layer

Formula

Transformer Layer Output Formula

The output of a Transformer layer at depth $l$ , denoted as $\mathbf{H}^{l+1}$ , is computed by applying the layer's transformation function to its input, $\mathbf{H}^{l}$ . This relationship can be expressed by the formula: $\mathbf{H}^{l+1} = \mathrm{Layer}(\mathbf{H}^{l})$ .