Concept

Fitting LLM Learning Curves with Diverse Functions

In response to the inability of simple monotonic functions to capture all aspects of LLM learning, researchers explore more sophisticated and diverse mathematical functions to model training curves. This approach, exemplified in studies by Alabdulmohsin et al. [2022] and Caballero et al. [2023], aims to find better fits for complex phenomena that standard scaling laws miss.

0

1

Updated 2026-05-02

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences