Definition

Input Representation in a Transformer Layer

The input to a Transformer layer at a given depth ll is represented as a sequence of hidden states, denoted by Hl=h0lh1l...hml\mathbf{H}^l = \mathbf{h}_0^l \mathbf{h}_1^l ... \mathbf{h}_m^l. In this notation, Hl\mathbf{H}^l is the sequence containing all hidden states for the layer.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.3 Prompting - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related