Model Suitability for Novel Tasks
A research team wants to build a system that can answer questions about newly published scientific papers across various disciplines, a task for which no pre-existing, labeled dataset exists. Explain why a large, generative language model is inherently better suited for this 'zero-shot' scenario compared to a model like BERT, focusing on their fundamental differences in handling new tasks.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A development team has access to two pre-trained language models. Model X is a large, generative model known for its ability to follow general instructions. Model Y is an encoder-based model that achieves state-of-the-art performance on a task after it has been fine-tuned with thousands of task-specific examples. The team's immediate goal is to create a prototype that can summarize legal documents, a task for which they have no training data. Which of the following statements most accurately analyzes the situation?
Model Selection for a Resource-Constrained Startup
Model Suitability for Novel Tasks