Learn Before
Two language models, Model A and Model B, have their performance (loss, L) modeled as a function of a resource x (where x > 1). The relationship for each is described by a power law equation:
- Model A:
L(x) = 0.5 * x^-0.1 - Model B:
L(x) = 0.5 * x^-0.2
Based on these equations, which statement correctly analyzes the models' improvement as more of the resource x is used?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Empirical Power Law for LLM Loss vs. Model Size (N)
Empirical Power Law for LLM Loss vs. Dataset Size (D)
Two language models, Model A and Model B, have their performance (loss, L) modeled as a function of a resource
x(wherex > 1). The relationship for each is described by a power law equation:- Model A:
L(x) = 0.5 * x^-0.1 - Model B:
L(x) = 0.5 * x^-0.2
Based on these equations, which statement correctly analyzes the models' improvement as more of the resource
xis used?- Model A:
Interpreting the Power Law Exponent
Model Selection Based on Performance Scaling