Activity (Process)

Filtering in Self-Instruct

In the Self-Instruct framework, newly generated samples are evaluated using heuristic rules before being accepted. A key heuristic involves filtering out samples or instructions that are too similar to those already present in the task pool. Samples that successfully pass this examination are then added to the pool, ensuring the dataset's quality and novelty.

Image 0

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related