1Cademy - A development team is fine-tuning a language model for a specialized medical question-answering task where accuracy is critical. They have two potential datasets: Dataset A consists of 100,000 unfiltered Q&A pairs scraped from various online health forums. Dataset B consists of 5,000 Q&A pairs carefully curated and verified for accuracy by medical experts. Which statement best evaluates the most effective approach for the team?

Learn Before

Impact of Data Quality on Fine-Tuning Sample Size

Multiple Choice

A development team is fine-tuning a language model for a specialized medical question-answering task where accuracy is critical. They have two potential datasets: Dataset A consists of 100,000 unfiltered Q&A pairs scraped from various online health forums. Dataset B consists of 5,000 Q&A pairs carefully curated and verified for accuracy by medical experts. Which statement best evaluates the most effective approach for the team?

Updated 2025-10-03

Contributors are:

Who are from:

Learn Before

Related