A research team aims to adapt a powerful, existing language model to summarize entire books, a task requiring the model to process very long sequences of text. They have access to a vast, diverse dataset of general web text and a smaller, curated dataset composed exclusively of full-length books. To achieve their goal efficiently, what is the most effective two-stage approach for the team to follow?
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Role of Specific Positional Embeddings in Long-Context Pre-training
Evaluating a Model Adaptation Strategy
A research team aims to adapt a powerful, existing language model to summarize entire books, a task requiring the model to process very long sequences of text. They have access to a vast, diverse dataset of general web text and a smaller, curated dataset composed exclusively of full-length books. To achieve their goal efficiently, what is the most effective two-stage approach for the team to follow?
A machine learning engineer is adapting a pre-existing language model to effectively handle long documents. The process involves two distinct stages. Arrange the following stages in the correct chronological order.