Case Study

Model Selection Based on Performance Scaling

A research lab is training two different language models, Model Alpha and Model Beta. Their performance, measured by loss (L), is modeled as a function of the computational resources (x) used for training. The relationships are given by the following equations:

  • Model Alpha: L(x) = 2.5 * x^-0.1
  • Model Beta: L(x) = 5.0 * x^-0.2

The lab has a fixed budget that allows them to use x = 10,000 units of computational resources. Based on these scaling laws, which model should the lab choose to achieve the lowest possible loss with their available resources? Justify your answer with calculations.

0

1

Updated 2025-10-08

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Application in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science