Learn Before
Definition

Input and Output of a Sequence Encoder

An encoder, denoted as Encoderθ()\mathrm{Encoder}_{\theta}(\cdot), is a function parameterized by θ\theta that processes an input sequence of tokens, x=x0...xm\mathbf{x}=x_0...x_m, to produce a corresponding sequence of output vectors, H=h0...hm\mathbf{H}=\mathbf{h}_0...\mathbf{h}_m. If each hi\mathbf{h}_i is viewed as a row vector, the entire output sequence can be represented as the matrix H=[h0hm]\mathbf{H} = \begin{bmatrix} \mathbf{h}_0 \\ \vdots \\ \mathbf{h}_m \end{bmatrix}.

0

1

Updated 2026-04-15

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences