1Cademy - BERT (Bidirectional Encoder Representations from Transformers)

Learn Before

Concept

BERT (Bidirectional Encoder Representations from Transformers)

BERT (Bidirectional Encoder Representations from Transformers) stands out as one of the most popular and extensively used pre-trained sequence encoding models in the field of Natural Language Processing. As a foundational model, it is trained using a self-supervised approach that combines two tasks: masked language modeling (MLM) and next sentence prediction (NSP). In MLM, the model predicts randomly masked words from their context, enabling it to learn deep bidirectional language representations. This dual-task training makes BERT a versatile foundation model adaptable to a wide array of NLP applications.

Updated 2026-05-02

Contributors are:

Who are from:

University of Michigan - Ann Arbor

✔️ 3

Learn Before

Related

Learn After