1Cademy - A research team is training a 10-billion parameter language model. After consuming 25% of their total compute budget, they observe that the models performance improvement, when plotted against the compute used, is tracking perfectly along the curve predicted by established scaling laws. However, this predicted trajectory indicates that the model will fall short of its target performance goal by the time 100% of the budget is used. Based on the predictive utility of scaling laws, what is the most logical and resource-efficient decision for the team to make?

Learn Before

Predictive Utility of Scaling Laws for LLM Training Decisions

Multiple Choice

A research team is training a 10-billion parameter language model. After consuming 25% of their total compute budget, they observe that the model's performance improvement, when plotted against the compute used, is tracking perfectly along the curve predicted by established scaling laws. However, this predicted trajectory indicates that the model will fall short of its target performance goal by the time 100% of the budget is used. Based on the predictive utility of scaling laws, what is the most logical and resource-efficient decision for the team to make?

Updated 2025-10-02

Contributors are:

Who are from:

Learn Before

Related