Learn Before
Critique of an AI Training Methodology
A development team is training an AI assistant to solve complex, multi-step word problems in mathematics. Their current training method works as follows: the AI generates a complete solution, and if the final numerical answer is correct, the entire solution is marked as 'good.' If the final answer is incorrect, the entire solution is marked as 'bad.' Despite extensive training, the AI struggles to generalize its problem-solving skills to new types of problems.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Critique of an AI Training Methodology
A team is training a robot to assemble a complex piece of furniture from a kit. The assembly requires 20 distinct steps. Which of the following training approaches would be most effective for teaching the robot to perform this task reliably and to correct its own mistakes in the future?
Explaining Feedback Types with an Analogy