1Cademy - State the rule for determining class distribution in balanced learning curve subsets.

Learn Before

Balanced Subsets for Noisy Learning Curves in Skewed or Many-Class Data

Short Answer

State the rule for determining class distribution in balanced learning curve subsets.

Question: When constructing small training subsets to plot a learning curve on skewed or many-class data, what rule should you follow to determine the class distribution of each subset?

Sample answer: You should ensure that the fraction of examples from each class in the subset is as close as possible to the overall fraction of that class in the original training set.

Key points:

Do not select small subsets purely at random for skewed or many-class training data.
Keep subset class fractions as close as possible to the overall fractions in the original training set.

Rubric: The response must specify that the fraction or proportion of examples for each class in the subset should be as close as possible to the overall fraction in the original training set.

Updated 2026-07-02

Contributors are:

Who are from:

References

Learn Before

Related