Matching

When preparing text data to train a language model, various 'corruption' techniques are used to alter the original input, which the model then learns to restore. Some of these techniques operate on the word or token level, while others operate on the sentence level. Match each corruption technique described below with the structural requirement of the input text.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science