Learn Before
Optimizing Features for a Medical Image Classifier
Case context: You are building a diagnostic model with a very limited training dataset of 500 patient records. You initially extract 10,000 features per record. The model suffers from high variance. Your colleague suggests either dropping just 500 features or performing a massive 10x reduction down to 1,000 features to solve the problem.
Question: As a practitioner, how should you evaluate your colleague's suggestions for feature reduction, and what approach is most justified given the dataset size?
Sample answer: Given the small training set, applying feature selection is highly justified and very useful for reducing variance. However, reducing the feature count slightly (dropping only 500 out of 10,000) is unlikely to have a huge effect on bias or significantly resolve the variance. A massive 10x reduction to 1,000 features will have a much more significant effect on variance, but you must be careful not to exclude too many useful features, as this could increase bias. Therefore, the significant 10x reduction is the better approach to tackle the high variance, provided useful features are carefully retained.
Key points:
- Feature selection is highly appropriate when the training dataset is small.
- A slight reduction in features is unlikely to impact bias or sufficiently fix variance.
- A significant reduction is more likely to have a substantial effect.
- Care must be taken during significant reductions to avoid discarding useful features and increasing bias.
Rubric: Award full credit if the response correctly identifies that feature selection is appropriate for a small dataset, notes that a small reduction won't have much effect, and explains that a significant reduction is needed but carries the risk of increasing bias if useful features are lost.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
What two opposing effects can feature selection have on a model's error components?
Reducing input features from 1,000 to 900 is unlikely to have a large effect on model bias.
In modern deep learning with plentiful data, practitioners are more likely to give _____ features to the algorithm and let it sort out which ones to use.
Match each feature reduction scenario to its likely impact on model bias according to Machine Learning Yearning.
Order the reasoning steps for deciding whether to apply feature selection as a variance-reduction technique.
According to Andrew Ng, under which specific condition is feature selection described as 'very useful'?
In modern deep learning with plentiful data, practitioners have largely shifted away from manual feature selection.
Reducing features from 1,000 to _____ is described as a ~10× reduction that is more likely to have a significant effect on bias.
Match each concept to its correct description in the context of feature selection for variance reduction.
Order the steps for evaluating how much a proposed feature reduction will affect model bias.
Trade-offs of Feature Selection in Deep Learning
Optimizing Features for a Medical Image Classifier
Impact of Plentiful Data on Feature Selection