Multiple Choice

When training a model on a document with multiple sentences, what is the primary advantage of corrupting the input by randomly shuffling the order of entire sentences, as opposed to simply reordering individual tokens across the entire document?

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science