Evaluating Fine-Tuning Data Generation Strategies
A startup is developing a highly versatile creative writing assistant. They need to create a large fine-tuning dataset and are considering two different strategies. Evaluate which strategy is more likely to produce a robust and versatile model that can handle a wide variety of user requests, and justify your reasoning.
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Evaluation in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Self-Instruct for Generating Fine-Tuning Data
A development team is fine-tuning a language model to function as a specialized customer support chatbot. They have collected a large dataset of high-quality, expert-written answers to common customer issues. To create the training pairs, the team manually wrote simple, direct questions corresponding to each answer. After deployment, they observe that the model performs well on straightforward queries but fails to provide correct answers when users phrase their questions in unconventional, complex, or indirect ways. Which of the following strategies represents the most effective next step to address this specific performance issue?
Evaluating Fine-Tuning Data Generation Strategies
Analyzing Data Generation Strategies for Model Robustness