Activity (Process)

Data Selection and Filtering Using Weak Models

A method for curating training data for large language models involves leveraging a smaller, weaker model. This process entails calculating metrics such as likelihood or cross-entropy for each data sequence using the weak model. These metrics then serve as a basis for selection criteria, allowing for the filtering of less suitable data or the prioritization of high-quality data during pre-training or fine-tuning.

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences