Learn Before
Data Collection Strategy for an AI Coding Assistant
Evaluate the two proposed strategies. Which strategy is a more effective method for creating a high-quality dataset from a user base, and why? Justify your answer by explaining the primary advantage your chosen strategy offers over the other.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Workflow for Crowdsourcing Fine-Tuning Data
Advantages of Crowdsourcing Fine-Tuning Data
A company aims to improve its chatbot's ability to answer questions about its products. The proposed plan is to scrape their public user forum, collecting user-posted questions and pairing them with the corresponding community-provided answers that have the most 'upvotes'. What is the most critical flaw in this strategy for creating a high-quality dataset?
Data Collection Strategy for an AI Coding Assistant
A development team is building a dataset to fine-tune a language model for a new, specialized domain. They plan to use a crowdsourcing approach. Arrange the following steps into the most logical and effective workflow for this process.