Case Study

Optimizing a Text Classification Pipeline

A team is building a text classifier for customer support tickets using a large, pre-trained language model. The model generates a vector representation for each ticket, which is then passed to a simple linear classifier for the final prediction. The system's accuracy is lower than desired.

Engineer A proposes replacing the simple linear classifier with a more complex, multi-layer neural network. Engineer B proposes focusing on fine-tuning the large, pre-trained language model itself on the ticket data.

Evaluate these two proposals. Which approach is generally considered the more impactful first step for improving performance in such a system, and why? Your explanation should address the distinct roles of the main model and the final prediction layer.

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science