Learn Before
Concept

Transformer Block Inputs and Outputs Notation

In a Transformer-decoder architecture, the inputs to a Transformer block are denoted by the sequence z0,...,zm1\mathbf{z}_0,...,\mathbf{z}_{m-1}. After processing through the entire model, the outputs of the last Transformer block (the LL-th block) are denoted as h0L,...,hm1L\mathbf{h}_0^{L},...,\mathbf{h}_{m-1}^{L}.

Image 0

0

1

Updated 2026-04-19

Contributors are:

Who are from:

Tags

Foundations of Large Language Models

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences