1Cademy - A research team is training a new language model and records the following test error rates as they increase the size of the training dataset: - 1 billion tokens: Error 3.50 - 2 billion tokens: Error 3.45 - 4 billion tokens: Error 3.42 - 8 billion tokens: Error 3.10 - 16 billion tokens: Error 2.50 Based on this data, at what point does the model most clearly transition *out of* the initial, slow improvement stage of training?

Multiple Choice

A research team is training a new language model and records the following test error rates as they increase the size of the training dataset:

Based on this data, at what point does the model most clearly transition out of the initial, slow improvement stage of training?

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science