Learn Before
Analyzing Data Samples for Instruction-Following
You are reviewing a dataset intended for training a language model to follow instructions. Below are three data samples. Identify which sample (A, B, or C) is NOT structured as a standard input-output pair for this purpose and briefly explain why it is unsuitable.
Sample A:
- Input: "Instruction: Summarize the following text in one sentence. Text: The sun is a star at the center of the Solar System. It is a nearly perfect sphere of hot plasma, with internal convective motion that generates a magnetic field via a dynamo process."
- Output: "The sun is a plasma star at the center of our solar system that generates a magnetic field."
Sample B:
- Input: "Instruction: Write a short poem about the ocean."
- Output: A pair of responses is provided for comparison: Response 1 ('The waves crash high, a salty sigh, beneath the endless, azure sky.') is marked as 'better' than Response 2 ('Water is blue and very deep.').
Sample C:
- Input: "Instruction: Translate 'hello world' to French."
- Output: "Bonjour le monde"
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Example of SFT Dataset Samples
Key Attributes of Effective SFT Datasets and Their Impact on LLM Performance
Input and Output Sequences in SFT
A team is preparing a dataset to fine-tune a pre-trained language model to follow specific instructions. Which of the following data entries best exemplifies the fundamental structure of a single sample in a Supervised Fine-Tuning (SFT) dataset?
Evaluating a Potential Fine-Tuning Dataset
Characteristics of SFT Datasets
Analyzing Data Samples for Instruction-Following