1Cademy - Generating Fine-Tuning Samples for Machine Translation

Learn Before

Initial Step in Creating Machine Translation Fine-Tuning Data

Activity (Process)

Generating Fine-Tuning Samples for Machine Translation

The creation of fine-tuning data for machine translation involves collecting pairs of source and target texts. These text pairs are then used to populate variables, such as {∗text∗} for the source text and {∗translation∗} for the target text, within a predefined prompt template. This substitution process generates the final samples needed for fine-tuning the model.

Updated 2026-05-01

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Example of a Generated Fine-Tuning Sample for Machine Translation
Example of a Structured Fine-Tuning Sample for Machine Translation
A developer is preparing data to train a language model for a French-to-Spanish translation task. They are using the following prompt template and text pair:

Template: Translate the following text from French to Spanish.\n\nFrench: {text}\n\nSpanish: {translation}

Text Pair:
- Source Text (French): "Bonjour, comment ça va ?"
- Target Text (Spanish): "Hola, ¿cómo estás?"
Analyze the options below and select the one that represents a correctly generated fine-tuning sample based on t
You are preparing a dataset to fine-tune a language model for a machine translation task. Arrange the following actions in the correct chronological order to generate a single, complete fine-tuning sample.
Troubleshooting a Machine Translation Fine-Tuning Sample
Example of a Concatenated $(\mathbf{x}, \mathbf{y})$ Sample for Machine Translation Fine-Tuning

Learn Before

Related

Learn After