Learn Before
A research team trains a large, powerful model by fine-tuning it on a dataset labeled by a smaller, less accurate model. After this training process, they evaluate the powerful model on a held-out test set and find its performance is 85%. This 85% figure represents the weak-to-strong performance (Pweak→strong). What is the most accurate interpretation of this result?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Performance Gap Recovered (PGR)
A research team trains a large, powerful model by fine-tuning it on a dataset labeled by a smaller, less accurate model. After this training process, they evaluate the powerful model on a held-out test set and find its performance is 85%. This 85% figure represents the weak-to-strong performance (Pweak→strong). What is the most accurate interpretation of this result?
Measuring Weak-to-Strong Generalization
To measure the weak-to-strong performance (Pweak→strong) of a powerful model, a specific sequence of actions must be followed. Arrange the core steps below into the correct chronological order.