1Cademy - Role of the Adapter in BERT-based Encoder-Decoder Models

Learn Before

Architecture of a BERT-based Encoder-Decoder Model

Concept

Role of the Adapter in BERT-based Encoder-Decoder Models

In a BERT-based encoder-decoder architecture, an adapter is an optional layer that serves as a bridge between the encoder and the decoder. Its primary function is to map the output representations generated by the BERT encoder into a format that is more suitable for the decoder to process. This helps align the output space of the pre-trained encoder with the input requirements of the decoder.

Updated 2026-04-18

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

An engineer is constructing a text summarization model by using a large, pre-trained language model as the encoder and a separate, newly initialized transformer as the decoder. The engineer observes that the model struggles to learn effectively. They hypothesize that the rich, complex output vectors from the pre-trained encoder are not in a format that the new decoder can easily interpret. Which of the following strategies directly addresses this specific problem by creating a bridge between the
Analyzing the Utility of an Adapter Layer
Diagnosing a Sequence-to-Sequence Model Failure

Learn Before

Related

Learn After