1Cademy - Weak Performance (Pweak) as a Baseline Metric

Learn Before

Utility of Weak Models in Assisting Stronger Models

Definition

Weak Performance (Pweak) as a Baseline Metric

Weak Performance (Pweak) is a metric used to evaluate the generalization improvement in weak-to-strong fine-tuning. It is defined as the performance of the weak model on a given test set and serves as the baseline for comparison against the stronger model's performance.

Updated 2026-05-01

Contributors are: