Example

Example of a Concatenated (x,y)(\mathbf{x}, \mathbf{y}) Sample for Machine Translation Fine-Tuning

In machine translation, fine-tuning data can be formulated as a concatenated (x,y)(\mathbf{x}, \mathbf{y}) pair, functioning as a complete training sample for a Large Language Model. The input sequence, denoted mathematically as x\mathbf{x}, is constructed by joining the task instruction and the specific source text. The target output, denoted as y\mathbf{y}, provides the correct translation. For example, an English-to-Chinese translation sample is represented as:

  • x\mathbf{x} = "Translate the text from English to Chinese.\n How's the weather today?"
  • y\mathbf{y} = "今天天气怎么样?"

This structured (x,y)(\mathbf{x}, \mathbf{y}) pair is then used directly to adjust the model's parameters during fine-tuning.

Image 0

0

1

Updated 2026-05-01

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences