Learn Before
A large language model's ability to perform tasks it has never been specifically trained on is primarily achieved by adding a specialized 'zero-shot capability module' after its initial pre-training is complete.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A team of engineers trains a new large-scale language model on a massive and diverse dataset of text from the internet. After training, they are surprised to find that the model can accurately translate sentences from English to French, even though it was never explicitly given English-to-French translation examples. Which statement best analyzes the origin of this unexpected capability?
Evaluating Claims about LLM Capabilities
A large language model's ability to perform tasks it has never been specifically trained on is primarily achieved by adding a specialized 'zero-shot capability module' after its initial pre-training is complete.