1Cademy - Notation in a BERT-based Encoder-Decoder Architecture

Learn Before

Architecture of a BERT-based Encoder-Decoder Model

Definition

Notation in a BERT-based Encoder-Decoder Architecture

In a BERT-based encoder-decoder architecture, specific mathematical notation is used to represent the sequences involved in the process. The source text is denoted as a sequence of tokens, $x_1...x_m$ , and its corresponding sequence of embeddings is represented as $\mathbf{e}_{1}^{x}...\mathbf{e}_{m}^{x}$ . On the generation side, the target text sequence generated by the decoder is denoted as $y_1...y_n$ , with its corresponding embedding sequence represented as $\mathbf{e}_{1}^{y}...\mathbf{e}_{n}^{y}$ .