Evaluating Prompt Designs for Nested Structures
Two researchers are designing prompts to test a large language model's ability to correctly complete nested bracket sequences. Analyze the two prompts below and determine which one provides a more robust test of the model's capabilities. Justify your reasoning.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A language model is given the following input sequence of opening brackets:
( { [. The model is tasked with generating the correct sequence of closing brackets to form a syntactically valid structure. Which of the following outputs represents the correct completion?Analyzing a Language Model's Failure on Nested Structures
Initiating Step-by-Step Reasoning for Problem Solving
Evaluating Prompt Designs for Nested Structures