1Cademy - A research team is training a large language model and plots its test error against the training dataset size on a log-log scale. The resulting curve is divided into three distinct regions. Region A shows an initial, slow decrease in error. Region B shows a steep, consistent, and linear decrease in error. Region C shows the rate of error decrease slowing down significantly, approaching a plateau. In which region would increasing the training dataset size be the most effective and predictable strategy for improving the models performance?

Learn Before

Power-law Reduction Phase in LLM Scaling

Multiple Choice

A research team is training a large language model and plots its test error against the training dataset size on a log-log scale. The resulting curve is divided into three distinct regions. Region A shows an initial, slow decrease in error. Region B shows a steep, consistent, and linear decrease in error. Region C shows the rate of error decrease slowing down significantly, approaching a plateau. In which region would increasing the training dataset size be the most effective and predictable strategy for improving the model's performance?

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related