Evaluating an AI's Problem-Solving Output
An AI assistant is given the following problem: "A company's profit increased from $50,000 in Year 1 to $65,000 in Year 2. What is the percentage increase in profit?"
The AI provides this response: "Calculation: ((65000 - 50000) / 50000) * 100 = 30. Final Answer: 30%"
Although the final answer is numerically correct, evaluate the quality of the AI's output. Identify the key element that is missing from a comprehensive final step in a structured reasoning process and explain why its inclusion is critical for a high-quality answer.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Computing Sciences
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models Course
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
An AI assistant is tasked with solving the following problem: "A rectangular garden is 15 meters long and 8 meters wide. A fence is to be built around its perimeter. If the fencing material costs $10 per meter, what is the total cost of the fence?"
The AI performs the following calculation:
- Perimeter = 2 * (15 + 8) = 46 meters
- Total Cost = 46 * 10 = $460
It then immediately provides the final output: "$460".
Even assuming the calculation is arithmetically correct, which statement best identifies a critical weakness in the AI's reasoning process?
Evaluating an AI's Problem-Solving Output
A problem-solving agent is designed to approach mathematical questions methodically. Arrange the following actions into the most logical and effective sequence from first to last.