Learn Before
Using a Pre-trained Model for Transfer Learning in Deep Learning
- Select a pre-trained model that is already well-trained to classify/predict similar outputs from the same inputs.
- Replace the old last few layer(s) with one or more new last layer(s).
- Fine-tune: retrain and tune the model on the new dataset for the new task.
0
2
Contributors are:
Who are from:
Tags
Data Science
Ch.1 Pre-training - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Benefits of Pre-training
Using a Pre-trained Model for Transfer Learning in Deep Learning
Model Training Strategy for Medical Imaging
A research team is developing a model to classify rare plant diseases from a small, specialized dataset of only 500 leaf images. They are considering several training strategies. Which of the following strategies best demonstrates an understanding of how to leverage an initial, general-purpose training phase to overcome the limitation of a small dataset?
Rationale for Pre-training with General Data
Learn After
POPULAR PRE-TRAINED MODELS
Number of Layers to Freeze When Using a Pre-trained Model for Transfer Deep Learning
Implementing Freezing Layers When Using a Pre-trained Model for Transfer Deep Learning
An engineer needs to build a model to classify 15 types of local wildflowers using a custom dataset of only 900 images. They select a very deep and complex neural network that was previously trained on a dataset of over a million general-purpose images (e.g., animals, vehicles, household objects). The engineer's strategy is to retrain all layers of this complex network from scratch, using only their small wildflower dataset. What is the most likely outcome of this strategy?
You are tasked with building an image classifier for a new, specialized task (e.g., identifying specific types of industrial equipment), but you only have a small, custom dataset. You decide to adapt a model that has already been trained on a very large, general image dataset. Arrange the following steps in the correct logical order to implement this strategy.
Adapting a Pre-trained Network for a New Task