Learn Before
Interpreting the PGR Formula's Denominator
In the formula used to calculate the extent to which a strong model's performance improves through supervision by a weak model, the denominator is calculated as the difference between the strong model's maximum potential performance and the weak model's baseline performance. What does this denominator represent in the context of the overall calculation, and why is it a crucial component of the formula?
0
1
Tags
Ch.4 Alignment - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
PGR Calculation Scenario
In an experiment, a researcher observes that the performance of a strong model after being supervised by a weak one (P_weak→strong) is actually lower than the weak model's initial baseline performance (P_weak). Assuming the strong model's maximum potential performance (P_ceiling) is greater than the weak model's baseline, what is the resulting Performance Gap Recovered (PGR) and what does it signify?
Interpreting the PGR Formula's Denominator