An AI research lab is developing a new large language model and has a fixed computational budget. According to the principles that formalize the relationship between a model's performance, its size, and the quantity of its training data, which of the following strategies is most likely to yield the best-performing model within their budget?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.3 Prompting - Foundations of Large Language Models
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Modeling LLM Performance with Scaling Functions
Guiding Role of Scaling Laws in LLM Research
Predictive Utility of Scaling Laws for LLM Training Decisions
Evolving Understanding of Scaling Laws
Insufficiency of Model Size Scaling for AGI
An AI research lab is developing a new large language model and has a fixed computational budget. According to the principles that formalize the relationship between a model's performance, its size, and the quantity of its training data, which of the following strategies is most likely to yield the best-performing model within their budget?
Evaluating Competing LLM Training Strategies
The Strategic Importance of Predictable Performance Scaling