Data Filtering for a Specialized Language Model
Based on the scenario provided, which data sample would you recommend excluding from the final training set for the large, specialized model? Justify your decision based on the provided scores and the underlying principle of this filtering technique.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Data Filtering for a Specialized Language Model
A team is refining a large, general-purpose text corpus to train a specialized language model. They use a smaller, pre-existing model to calculate the cross-entropy for each document. Their goal is to create a high-quality, coherent, and well-structured training set. Which of the following filtering strategies should they implement and why?
Interpreting Cross-Entropy for Data Curation