Essay

Comparing Model Architectures for Text Extraction Tasks

Imagine you are tasked with building two different systems. System A must identify a continuous phrase (e.g., a person's name, a date) within a sentence that answers a specific question. System B must classify every single word in a sentence into predefined categories (e.g., Person, Location, Organization, or Other). Both systems will use the same foundational language model that produces a contextualized vector for each input word.

Compare and contrast the design of the final prediction layers you would build on top of the foundational model for System A versus System B. Explain why the different task requirements necessitate these different architectural choices.

0

1

Updated 2025-10-06

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science