Learn Before
A development team first trains a large neural network to classify thousands of user reviews as either 'positive' or 'negative'. After this initial training is complete, they remove the final classification layer and use the rest of the trained network as a starting point to build a new system for automatically generating marketing slogans. What is the most significant advantage of this two-step approach?
0
1
Tags
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A development team first trains a large neural network to classify thousands of user reviews as either 'positive' or 'negative'. After this initial training is complete, they remove the final classification layer and use the rest of the trained network as a starting point to build a new system for automatically generating marketing slogans. What is the most significant advantage of this two-step approach?
A machine learning engineer wants to use a large dataset of customer support emails, each labeled as 'urgent' or 'not urgent', to pre-train a model. The ultimate goal is to create a new system that can automatically categorize incoming emails into more specific topics (e.g., 'billing issue', 'technical problem', 'feedback'). Arrange the following steps in the correct logical order to accomplish this.
An e-commerce company is building a chatbot to answer customer questions about product specifications. They have a small, high-quality dataset of 5,000 question-answer pairs specific to their products, but find the chatbot struggles to understand the nuances of customer phrasing. The company also has access to a massive dataset of 2 million online forum posts, where each post has been labeled by moderators as either 'question' or 'statement'. Which of the following strategies describes the most appropriate way to use the forum post dataset to improve the final chatbot?