Concept

Inadequacy of Pre-trained Model Parameters for Downstream Tasks

Because the initial parameters of a pre-trained model—specifically the classifier parameters ω\omega and the encoder parameters θ^\hat{\theta}—are not originally optimized for a specific downstream classification task, the model cannot be applied directly. Instead, a modified version of the model must be created and adapted to achieve accurate results on the new task.

0

1

Updated 2026-04-14

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences