Diagnosing Training Flaws in a Math AI
Based on the following scenario, explain the fundamental limitation of the training approach and why it results in an unreliable model.
0
1
Tags
Ch.5 Inference - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Learning Analogy: Outcome vs. Process Feedback
A research team is training a language model to act as a programming assistant that writes complex, multi-step code functions. The training method rewards the model only if the final generated code executes without errors and produces the correct output. Despite extensive training, the model frequently generates code that is logically flawed, even if it sometimes produces the correct final result for the training examples. Which of the following statements best analyzes the fundamental weakness of this training approach?
Diagnosing Training Flaws in a Math AI
Critique of AI Training Methodologies for Complex Tasks