1Cademy - Mismatch Between Pre-training and Downstream Objectives

Learn Before

Inadequacy of Pre-trained Model Parameters for Downstream Tasks

Short Answer

Mismatch Between Pre-training and Downstream Objectives

A research lab develops a large-scale language model by training it on a massive corpus of historical literature to become an expert at completing sentences in the style of 18th-century authors. A separate team wants to use this model, without any modifications, to power a modern-day customer service chatbot. Explain the fundamental reason why this approach is likely to fail, focusing on the model's internal configuration.

Updated 2025-10-06

Contributors are:

Who are from:

Learn Before

Related