Diagnosing a Language Model's Output
A user provides a language model with the prompt and article below. Analyze the model's response and identify the most fundamental capability the model is lacking that leads to this incorrect output. Explain your reasoning.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A developer is building a tool that allows users to generate custom email responses by providing specific commands, such as 'Write a polite refusal to this meeting invitation.' They have two language models to choose from:
- Model X: A very large model trained on a vast library of books and websites, excelling at generating fluent, human-like text.
- Model Y: A smaller model specifically trained on a dataset of commands paired with their correct outputs.
Which model is the more suitable choice for this tool, and why?
Enabling Instruction Following via Pre-training
Diagnosing a Language Model's Output
Analyzing Model Failure