Initial Step in Creating Machine Translation Fine-Tuning Data
To illustrate the process of creating fine-tuning data, consider an English-to-Chinese machine translation task. The first step is not simply collecting parallel texts, but actively designing a prompt template. This template serves to clearly describe the task and define the problem's format for the model.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Foundations of Large Language Models Course
Related
Complexity of Data Annotation for LLMs vs. Conventional NLP
Initial Step in Creating Machine Translation Fine-Tuning Data
Limitations of Manual Data Generation for Fine-Tuning
Difficulty of Human Annotation for Complex Tasks
A small, unfunded research lab wants to fine-tune a language model for a highly specialized, novel task: generating legal summaries of court proceedings for a niche area of patent law. They have access to a few legal experts but have a very limited budget. If they choose to have their experts create the input-output training pairs from scratch, which statement best evaluates the primary trade-off they will face?
Diagnosing Model Performance Issues
Evaluating Data Generation Strategy for a General-Purpose LLM
Learn After
Generating Fine-Tuning Samples for Machine Translation
A team is beginning a project to fine-tune a language model for a new task: translating technical manuals from English to Japanese. They have already acquired a large collection of parallel English and Japanese technical documents. Which of the following actions should the team prioritize as their immediate first step to ensure the model learns the task effectively?
When preparing a dataset for a machine translation fine-tuning task, the most effective initial action is to gather a large volume of source and target text pairs before considering how the task will be presented to the model.
You are tasked with creating a fine-tuning dataset for a language model to perform English-to-Spanish translation. Arrange the following actions into the correct chronological order.
Examples of Prompt Templates for English-to-Chinese Translation