AI Tutor Data Generation Strategy
A non-profit organization aims to create a reliable AI tutor for high school biology. To generate training data, they plan to collect questions from high school students and then use a powerful, general-purpose AI model to write the answers. These question-and-answer pairs will then be used to train the tutor. Based on this plan, identify one major advantage and one significant disadvantage of this approach.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
A company is building a specialized chatbot to provide users with reliable legal information. To create the training data, the team first gathers a large set of legal questions from the general public via an online platform. Next, they use a highly advanced, general-purpose language model to generate answers to all of these questions. These question-answer pairs are then used to fine-tune their new chatbot. Which of the following describes the most significant risk inherent in this specific data creation method?
AI Tutor Data Generation Strategy
Diagnosing a Flawed Fine-Tuning Dataset