A team is refining a large, general-purpose text corpus to train a specialized language model. They use a smaller, pre-existing model to calculate the cross-entropy for each document. Their goal is to create a high-quality, coherent, and well-structured training set. Which of the following filtering strategies should they implement and why?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Data Filtering for a Specialized Language Model
A team is refining a large, general-purpose text corpus to train a specialized language model. They use a smaller, pre-existing model to calculate the cross-entropy for each document. Their goal is to create a high-quality, coherent, and well-structured training set. Which of the following filtering strategies should they implement and why?
Interpreting Cross-Entropy for Data Curation