Diagnosing AI Performance in a Legal Context
Based on the scenario provided, diagnose the most probable underlying issue with the AI's method of processing the document and explain your reasoning.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
An AI model is evaluated on its ability to understand a long, complex historical document. When asked, 'What year is mentioned in the third sentence of the 27th paragraph?', the model answers correctly. However, when asked, 'Based on the author's arguments in the first and final chapters, what is the author's primary critique of the events described?', the model provides a vague summary of the entire document without identifying the specific critique. Which of the following is the most likely explanation for this discrepancy in performance?
Diagnosing AI Performance in a Legal Context
Designing a Robust LLM Evaluation