Computational Trade-offs in Model Development
A technology company wants to create a specialized language model for analyzing medical research papers. They can either (A) train a new, large model from the beginning using a massive dataset of medical texts, or (B) adapt an existing, powerful general-purpose model by continuing its training on the same medical dataset. Analyze the primary difference between these two approaches in terms of the computational resources required.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A small research lab with a limited budget and computational resources wants to develop a model that can summarize scientific papers. They have access to a large, general-purpose language model that has already been trained on a massive corpus of internet text. Given their resource constraints, which strategy is the most computationally efficient for them to pursue?
Computational Trade-offs in Model Development
Evaluating a Model Development Strategy