Learn Before
Scaling LLMs Beyond Size
A team of AI developers has a fixed computational budget and cannot increase the size of their language model or the volume of its training data. Describe an alternative approach they could take to enhance the model's utility, and provide a specific example of a new capability this approach could unlock.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Ch.3 Prompting - Foundations of Large Language Models
Application in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Architectural Adaptation of LLMs for Long Sequences
Types of LLM Scaling
Multifaceted Nature of LLM Scaling
Inference-Time Compute Scaling for Improved Reasoning
A research lab has a powerful language model that is highly effective at generating short, creative story paragraphs. The lab now wants to use this model to write entire multi-chapter novels, which requires maintaining plot consistency and character arcs over tens of thousands of words. Which of the following development priorities best represents a shift in scaling dimension to meet this new requirement?
Evaluating a Model Scaling Strategy
Scaling LLMs Beyond Size