1Cademy - Limitations of Monotonic Scaling Functions

Learn Before

Absence of a Universal Scaling Law

Concept

Limitations of Monotonic Scaling Functions

The scaling laws commonly used to model LLM performance, such as power laws, are based on monotonic functions. A significant limitation of this approach is that these functions cannot capture more complex learning dynamics that include inflection points, such as the double descent phenomenon.

Updated 2026-04-22

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn After

Fitting LLM Learning Curves with Diverse Functions
A research team is modeling the performance of a large language model as they increase the amount of training data. Their predictive model, based on a standard power-law function, anticipates a steady, continuous improvement in performance. However, their experiments show that the model's error rate first decreases, then temporarily increases, before decreasing again. Which statement best analyzes the limitation of their predictive model in this context?
Evaluating a Predictive Model for LLM Training
Predicting Complex Learning Dynamics

Learn Before

Related

Learn After