1Cademy - Analyzing Data Samples for Instruction-Following

Learn Before

Definition of SFT Datasets

Short Answer

Analyzing Data Samples for Instruction-Following

You are reviewing a dataset intended for training a language model to follow instructions. Below are three data samples. Identify which sample (A, B, or C) is NOT structured as a standard input-output pair for this purpose and briefly explain why it is unsuitable.

Sample A:

Input: "Instruction: Summarize the following text in one sentence. Text: The sun is a star at the center of the Solar System. It is a nearly perfect sphere of hot plasma, with internal convective motion that generates a magnetic field via a dynamo process."
Output: "The sun is a plasma star at the center of our solar system that generates a magnetic field."

Sample B:

Input: "Instruction: Write a short poem about the ocean."
Output: A pair of responses is provided for comparison: Response 1 ('The waves crash high, a salty sigh, beneath the endless, azure sky.') is marked as 'better' than Response 2 ('Water is blue and very deep.').

Sample C:

Input: "Instruction: Translate 'hello world' to French."
Output: "Bonjour le monde"

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Learn Before

Related