Short Answer

Evaluating a Fine-Tuning Strategy for a Specialized LLM

A team is fine-tuning a language model to create a specialized assistant for the legal domain. The senior developer argues for using their entire data budget on a single, massive dataset of one task: summarizing legal documents. They believe this will maximize performance. A junior developer suggests diversifying the fine-tuning data to include several other legal tasks (e.g., contract analysis, question answering), even if it means using fewer examples for the summarization task.

Critique the senior developer's strategy. Is it the optimal approach for creating a robust and versatile legal assistant? Explain your reasoning.

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science