A researcher is designing a test to specifically evaluate a Large Language Model's commonsense reasoning capabilities, which rely on implicit, real-world knowledge not explicitly stated in the prompt. Which of the following prompts would be the most effective for this specific purpose?
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A researcher is designing a test to specifically evaluate a Large Language Model's commonsense reasoning capabilities, which rely on implicit, real-world knowledge not explicitly stated in the prompt. Which of the following prompts would be the most effective for this specific purpose?
Analysis of a Commonsense Reasoning Failure
Evaluating an LLM's Commonsense Reasoning Failure