Learn Before
Short Answer

Analyzing Data Samples for Instruction-Following

You are reviewing a dataset intended for training a language model to follow instructions. Below are three data samples. Identify which sample (A, B, or C) is NOT structured as a standard input-output pair for this purpose and briefly explain why it is unsuitable.

Sample A:

  • Input: "Instruction: Summarize the following text in one sentence. Text: The sun is a star at the center of the Solar System. It is a nearly perfect sphere of hot plasma, with internal convective motion that generates a magnetic field via a dynamo process."
  • Output: "The sun is a plasma star at the center of our solar system that generates a magnetic field."

Sample B:

  • Input: "Instruction: Write a short poem about the ocean."
  • Output: A pair of responses is provided for comparison: Response 1 ('The waves crash high, a salty sigh, beneath the endless, azure sky.') is marked as 'better' than Response 2 ('Water is blue and very deep.').

Sample C:

  • Input: "Instruction: Translate 'hello world' to French."
  • Output: "Bonjour le monde"

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science