1Cademy - Improved Power Law Formula for LLM Loss

Learn Before

Improved Power Law for LLM Loss with Irreducible Error

Formula

Improved Power Law Formula for LLM Loss

The mathematical formulation for the improved scaling law incorporates an irreducible error term, $\epsilon_{\infty}$ , into the basic power law, yielding the equation: $\mathcal{L}(x) = ax^b + \epsilon_{\infty}$ . This equation is one of the most widely used forms for designing scaling laws in Large Language Models. In this expression, $\epsilon_{\infty}$ represents the irreducible error resulting from unknown variables, which persists even as the variable of interest approaches infinity ( $x \to \infty$ ).