1Cademy - Sampling in Self-Instruct

Learn Before

Initialization of the Task Pool in Self-Instruct

Activity (Process)

Sampling in Self-Instruct

In the Self-Instruct cycle, a few existing instructions are drawn from the task pool. These sampled instructions serve as in-context examples to prompt the Large Language Model for the generation of a new, related instruction.

Updated 2026-05-01

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Instruction Generation in Self-Instruct
A team is developing a system to automatically generate new instructional tasks for a large language model. The system works by first selecting a few existing tasks from a large pool to serve as examples. In one run, the system selects three examples that are all variations of the same task: 'Sort a list of integers in ascending order.' What is the most probable outcome when these highly similar examples are used to prompt the model to generate a new instruction?
Critique of a Sampling Strategy for Instruction Generation
Diagnosing a Flaw in an Instruction Generation Pipeline

Learn Before

Related

Learn After