1Cademy - When preparing text data to train a language model, various corruption techniques are used to alter the original input, which the model then learns to restore. Some of these techniques operate on the word or token level, while others operate on the sentence level. Match each corruption technique described below with the structural requirement of the input text.

Learn Before

Corruption Methods for Multi-Sentence Sequences

Matching

When preparing text data to train a language model, various 'corruption' techniques are used to alter the original input, which the model then learns to restore. Some of these techniques operate on the word or token level, while others operate on the sentence level. Match each corruption technique described below with the structural requirement of the input text.

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related