Short Answer

Analysis of Input Corruption Impact

Consider two different methods for corrupting an input sentence for a language model's training. Method 1 replaces certain words with a generic placeholder symbol, keeping the sentence length the same. Method 2 completely removes certain words, resulting in a shorter sentence. Analyze the unique challenge that Method 2 presents to a model in learning the grammatical structure of a language, compared to Method 1.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science