Learn Before
Case Study

Strategic Model Improvement

A research lab has a large language model and a fixed budget for the next improvement cycle. They are weighing two options:

  1. Use the budget to acquire and process a new dataset that is ten times larger than their current one.
  2. Use the budget to fund a team of engineers for several months to experiment with novel, unproven architectural changes to the model, using the existing dataset.

Based solely on the established principle that describes how a model's final test performance relates to the amount of training data, which of these two strategies represents a more predictable path to achieving a lower test loss? Justify your reasoning.

0

1

Updated 2025-10-05

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science