Multiple Choice

A research team is creating a new dataset to improve a large language model's capabilities. They are considering two different approaches:

Approach 1: Compile over 100 existing academic natural language processing tasks (e.g., text summarization, sentiment analysis, grammar correction) and convert them all into a standardized instruction-response format, resulting in over one million training examples.

Approach 2: Collect 50,000 complex, real-world questions submitted by users to a technical support forum. Then, use a powerful existing model to generate initial answers, which are subsequently reviewed, corrected, and significantly improved by human experts to serve as high-quality demonstrations.

Which approach better represents the modern focus of creating instruction fine-tuning datasets, and why?

0

1

Updated 2025-09-26

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science