Learn Before
Case Study

Choosing the Right NLP Approach for a Specialized Task

A medical diagnostics company needs to develop a system that automatically classifies short patient reports into one of five predefined, highly specific disease categories. The system's accuracy must be extremely high and its behavior must be very predictable, as it will be used in a critical clinical workflow. The company has a large, high-quality dataset of reports, each expertly labeled with the correct category.

Two proposals are being considered:

  1. Proposal A: Use a state-of-the-art, massive language model that has been pre-trained on a vast corpus of general internet text. This model, which learned by predicting the next word in a sentence, has demonstrated impressive general knowledge and reasoning abilities. It would be adapted for the classification task.

  2. Proposal B: Design and train a new, smaller neural network from scratch, using only the company's own labeled dataset of patient reports. This model's architecture would be specifically tailored for this single classification task.

Evaluate the two proposals. Which proposal is more suitable for this specific application, and why? Justify your decision by contrasting the core strengths and potential weaknesses of each approach in the context of this high-stakes, narrow-domain problem.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Data Science

Deep Learning (in Machine learning)

Collective Intelligence

Psychology

Social Science

Empirical Science

Science

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Foundations of Large Language Models

Ch.2 Generative Models - Foundations of Large Language Models

Ch.3 Prompting - Foundations of Large Language Models

Evaluation in Bloom's Taxonomy

Cognitive Psychology

Related