Learn Before
Deconstructing an SFT Training Sample
Analyze the following data sample prepared for Supervised Fine-Tuning. Identify the complete 'input sequence' and the 'output sequence' that the model is being trained on. Explain your reasoning for this division.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.2 Generative Models - Foundations of Large Language Models
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Dataset Composition for RL Fine-Tuning in RLHF
A machine learning engineer is creating a dataset to fine-tune a language model to act as a helpful assistant. The goal is to teach the model to follow instructions and provide complete, high-quality answers. Which of the following examples represents the most effective input-output pair for this supervised fine-tuning task?
Structuring a Sample from Input and Output Segments
Deconstructing an SFT Training Sample
Constructing an SFT Training Pair for Text Summarization