Multiple Choice

A language model is being trained on a large dataset of text. After an initial training iteration, the model's performance is measured on three distinct sequences from the dataset, yielding the following loss values:

  • Sequence 1: Loss = 8.4
  • Sequence 2: Loss = 2.1
  • Sequence 3: Loss = 5.5

Based on the fundamental objective of this training process, which of the following statements most accurately describes the model's overall goal?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science