1Cademy - BART Models Corruption Methods for Multi-Sentence Sequences

Learn Before

Corruption Methods for Multi-Sentence Sequences

Concept

BART Model's Corruption Methods for Multi-Sentence Sequences

The BART (Bidirectional and Auto-Regressive Transformers) model employs two specific corruption strategies that are designed for sequences containing multiple sentences.

Updated 2026-05-02

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Sentence Reordering as an Input Corruption Method
A pre-training process is applied to the following two-sentence input: 'The team celebrated their victory. They had trained hard for months.' The input is transformed into: 'The team [MASK] hard for months.' The model is then tasked with reconstructing the original, complete text from this transformed input. Which specific data corruption technique, designed for handling sequences of text, does this process exemplify?
Choosing a Pre-training Strategy
A model is pre-trained using corruption techniques that operate on the structure of multi-sentence documents. Match each technique with its correct description.

Learn Before

Related

Learn After