Error Analysis in Fine-Tuning Data Preparation
A data scientist is preparing a dataset to fine-tune a model for translating German news headlines to Japanese. Analyze the following training sample they created. Identify the primary error in how the data is structured according to the concatenated input method, and briefly explain the correction.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A developer is preparing a dataset to fine-tune a language model for translating French legal text to English. The task instruction is 'Translate the following legal text from French to English.' and a specific French sentence to be translated is 'Le défendeur a plaidé non coupable.'. The correct English translation is 'The defendant pleaded not guilty.'. Which of the following options correctly formats this single data sample as a concatenated input (
x) and output (y) pair for training?Error Analysis in Fine-Tuning Data Preparation
Constructing a Fine-Tuning Sample for Translation