1Cademy - Power-Law Curve of Performance Scaling

Learn Before

Scaling Laws for LLMs

Concept

Power-Law Curve of Performance Scaling

A scaling law curve, which plots test error against a variable of interest such as training dataset size, can typically be divided into three phases. At the beginning, test errors decrease slowly for a short period. In the second phase, test errors decrease drastically, forming a power law curve. In the third phase, the reduction in error slows down again as the model encounters irreducible errors that cannot be eliminated regardless of the amount of training data.

Updated 2026-04-21

Contributors are:

Who are from:

References

Reference of Foundations of Large Language Models Course
Reference of Foundations of Large Language Models Course

Learn Before

Related