Short Answer

Balancing Generalization and Specialization

A team is developing a large language model. During the initial training phase on a massive, diverse dataset, they focus on an objective that encourages the model to predict the next word in a sentence. Later, for a specific customer, they further train the model on a small, curated dataset of legal documents to improve its ability to answer legal questions. Explain how these two distinct training stages address the two fundamental challenges of the pre-training paradigm.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science