1Cademy - Deconstructing an SFT Training Sample

Learn Before

Input and Output Sequences in SFT

Case Study

Deconstructing an SFT Training Sample

Analyze the following data sample prepared for Supervised Fine-Tuning. Identify the complete 'input sequence' and the 'output sequence' that the model is being trained on. Explain your reasoning for this division.

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Ch.2 Generative Models - Foundations of Large Language Models

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Dataset Composition for RL Fine-Tuning in RLHF
A machine learning engineer is creating a dataset to fine-tune a language model to act as a helpful assistant. The goal is to teach the model to follow instructions and provide complete, high-quality answers. Which of the following examples represents the most effective input-output pair for this supervised fine-tuning task?
Structuring a Sample from Input and Output Segments
Deconstructing an SFT Training Sample
Constructing an SFT Training Pair for Text Summarization

Learn Before

Related