Evaluating a Data Sourcing Strategy for a Specialized Chatbot
A startup aims to create a chatbot that provides expert-level advice on home gardening. To train the model, their lead engineer suggests fine-tuning a base model using a dataset created by scraping all posts and comments from a large, public online gardening forum. As the project lead, evaluate this strategy. Identify one major advantage and one significant potential risk of using this data source for the stated goal.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Sourcing Fine-Tuning Data from Q&A Websites
Evaluating a Data Sourcing Strategy for a Specialized Chatbot
A small startup with limited resources is fine-tuning a large language model to create a general-purpose, open-domain question-answering chatbot. Considering their constraints, which statement best analyzes the primary advantage of sourcing fine-tuning data from naturally occurring question-and-answer pairs on public websites?
Evaluating a Data Sourcing Strategy for a Specialized AI