1Cademy - A development team is fine-tuning a language model to act as a programming assistant. Their initial training data consists of thousands of simple instruction-code pairs, such as a prompt asking for a function to add two numbers and the corresponding correct code. After training, they observe that the model performs well on simple, one-step tasks but consistently fails to generate correct code for complex problems that require breaking the problem down into multiple logical steps. Which of the fo

Learn Before

Improving SFT Efficiency with Advanced Data Construction

Multiple Choice

A development team is fine-tuning a language model to act as a programming assistant. Their initial training data consists of thousands of simple instruction-code pairs, such as a prompt asking for a function to add two numbers and the corresponding correct code. After training, they observe that the model performs well on simple, one-step tasks but consistently fails to generate correct code for complex problems that require breaking the problem down into multiple logical steps. Which of the fo

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related