An engineer is preparing text data for a model. The process involves taking an original sentence and creating a 'damaged' version by altering some of its words. The engineer observes that the damaged sentences are consistently shorter in length (fewer total words) than their original counterparts. Which of the following data alteration methods is the engineer most likely using?
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
An engineer is preparing text data for a model. The process involves taking an original sentence and creating a 'damaged' version by altering some of its words. The engineer observes that the damaged sentences are consistently shorter in length (fewer total words) than their original counterparts. Which of the following data alteration methods is the engineer most likely using?
Choosing a Data Corruption Strategy
Identifying Text Corruption Methods