1Cademy - A development team is updating a pre-trained language model by further training it on a curated dataset of specific prompts and their desired, high-quality outputs (e.g., prompt: Explain gravity to a 5-year-old, output: Gravity is like a big, invisible hug from the Earth...). Why is this specific training process considered a method for model alignment?

Learn Before

Classification of Instruction Fine-Tuning as an Alignment Problem

Multiple Choice

A development team is updating a pre-trained language model by further training it on a curated dataset of specific prompts and their desired, high-quality outputs (e.g., prompt: 'Explain gravity to a 5-year-old,' output: 'Gravity is like a big, invisible hug from the Earth...'). Why is this specific training process considered a method for model alignment?

Updated 2025-09-26

Contributors are:

Who are from:

Learn Before

Related