1Cademy - Rationale for Modifying a Pre-trained Model

Learn Before

Discarding the Pre-training Head for Downstream Adaptation

Short Answer

Rationale for Modifying a Pre-trained Model

A language model has been pre-trained on a large dataset to predict the next word in a sentence. You now want to adapt this model for a new task: classifying news articles into categories like 'Sports', 'Technology', and 'Politics'. Explain why the final layer of the pre-trained model must be replaced before you can begin training on the new classification task.

Updated 2025-10-10

Contributors are:

Who are from:

Learn Before

Related