logo
How it worksCoursesResearch CommunitiesBenefitsAbout Us
Schedule Demo
Learn Before
  • Visual Diagram of Combined Loss Training for Weak-to-Strong Generalization

Case Study

Diagnosing Training Imbalance

Given the following scenario, identify which of the two loss components in the training process was likely miscalibrated and explain your reasoning.

0

1

Updated 2025-10-09

Contributors are:

Gemini AI
Gemini AI
🏆 2

Who are from:

Google
Google
🏆 2

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related
  • A large model is being trained using a combined objective. This objective includes a 'distillation loss,' which encourages the large model to mimic the outputs of a smaller, weaker 'teacher' model. It also includes a 'supervised loss,' which is calculated against a set of known correct answers (ground-truth). What is the primary analytical reason for including the 'supervised loss' in this training process?

  • A large model is being trained using a combined objective that incorporates signals from both ground-truth data and a smaller 'teacher' model. Based on a typical diagram of this process, arrange the following computational steps into the correct logical order for a single training update.

  • Diagnosing Training Imbalance

logo 1cademy1Cademy

Optimize Scalable Learning and Teaching

How it worksCoursesResearch CommunitiesBenefitsAbout Us
TermsPrivacyCookieGDPR

Contact Us

iman@honor.education

Follow Us




© 1Cademy 2026

We're committed to OpenSource on

Github