Improving a Creative Writing LLM
Based on the case study below, analyze the most probable reason for the language model's specific performance limitations and propose a data-centric strategy to address them.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Impact of Combined Datasets on LLM Performance
A development team is creating a new large language model intended to be a general-purpose, public-facing chatbot. They decide to pre-train it exclusively on a massive corpus consisting of peer-reviewed scientific papers and academic journals. Which of the following statements best evaluates the most likely outcome of this training strategy?
Improving a Creative Writing LLM
A large language model's pre-training corpus is carefully constructed by combining data from various sources to instill different capabilities. Match each data source with the primary capability it helps the model develop.