Notation in a BERT-based Encoder-Decoder Architecture
In a BERT-based encoder-decoder architecture, specific mathematical notation is used to represent the sequences involved in the process. The source text is denoted as a sequence of tokens, , and its corresponding sequence of embeddings is represented as . On the generation side, the target text sequence generated by the decoder is denoted as , with its corresponding embedding sequence represented as .
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.1 Pre-training - Foundations of Large Language Models
Related
Role of the Adapter in BERT-based Encoder-Decoder Models
Notation in a BERT-based Encoder-Decoder Architecture
BERT-based Encoder-Decoder for Neural Machine Translation
A developer is explaining the process of generating a target text sequence using an architecture composed of a pre-trained encoder and a separate decoder. Analyze the following statements from their explanation. Which statement incorrectly describes the relationship between the encoder's output and the decoder's input during the generation process?
A sequence-to-sequence model uses a pre-trained text model as its encoder and a separate model as its decoder. Arrange the following steps to accurately represent the data flow from the initial source text to the final generated target text.
Diagnosing an Encoder-Decoder Model Failure
Learn After
A sequence-to-sequence model is translating the source text 'The cat sat'. The model has already generated the first target token, 'Le'. To generate the second target token, what sequence of embeddings is fed into the decoder? Assume the decoder's input starts with a special token
<s>, the source text embeddings are denoted byex, and the target text embeddings are denoted byey.In a sequence-to-sequence model that uses a pre-trained text encoder, specific symbols are used to represent different parts of the data flow. Match each notational symbol with its correct description.
Encoder Input Notation