Example of a Concatenated Sample for Machine Translation Fine-Tuning
In machine translation, fine-tuning data can be formulated as a concatenated pair, functioning as a complete training sample for a Large Language Model. The input sequence, denoted mathematically as , is constructed by joining the task instruction and the specific source text. The target output, denoted as , provides the correct translation. For example, an English-to-Chinese translation sample is represented as:
- = "Translate the text from English to Chinese.\n How's the weather today?"
- = "今天天气怎么样?"
This structured pair is then used directly to adjust the model's parameters during fine-tuning.

0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Example of a Generated Fine-Tuning Sample for Machine Translation
Example of a Structured Fine-Tuning Sample for Machine Translation
A developer is preparing data to train a language model for a French-to-Spanish translation task. They are using the following prompt template and text pair:
Template: Translate the following text from French to Spanish.\n\nFrench: {text}\n\nSpanish: {translation}
Text Pair:
- Source Text (French): "Bonjour, comment ça va ?"
- Target Text (Spanish): "Hola, ¿cómo estás?"
Analyze the options below and select the one that represents a correctly generated fine-tuning sample based on the provided components.
You are preparing a dataset to fine-tune a language model for a machine translation task. Arrange the following actions in the correct chronological order to generate a single, complete fine-tuning sample.
Troubleshooting a Machine Translation Fine-Tuning Sample
Example of a Concatenated Sample for Machine Translation Fine-Tuning
Learn After
A developer is preparing a dataset to fine-tune a language model for translating French legal text to English. The task instruction is 'Translate the following legal text from French to English.' and a specific French sentence to be translated is 'Le défendeur a plaidé non coupable.'. The correct English translation is 'The defendant pleaded not guilty.'. Which of the following options correctly formats this single data sample as a concatenated input (
x) and output (y) pair for training?Error Analysis in Fine-Tuning Data Preparation
Constructing a Fine-Tuning Sample for Translation