A research team observes the performance of a new model as they increase the amount of training data. They plot the test error and notice a distinct pattern: the error first decreases, then temporarily increases, before starting to decrease again. When attempting to create a mathematical formula to predict this error curve, which approach would be most appropriate?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Power Law Formulation for LLM Loss
A research team observes the performance of a new model as they increase the amount of training data. They plot the test error and notice a distinct pattern: the error first decreases, then temporarily increases, before starting to decrease again. When attempting to create a mathematical formula to predict this error curve, which approach would be most appropriate?
Modeling an Unconventional Learning Curve
Evaluating Modeling Approaches for LLM Learning Curves