Learn Before
Trade-offs of Feature Selection in Deep Learning
Question: Discuss the trade-offs of using feature selection to reduce variance in a machine learning model. How do the scale of the feature reduction and the size of the training dataset influence the decision to apply this technique?
Sample answer: Feature selection reduces variance by limiting the number or type of input features, but it risks increasing bias by removing potentially useful information. A small reduction in features typically has a minimal effect on bias, whereas a large reduction (e.g., 10x) significantly increases the risk of higher bias. In modern deep learning with plentiful data, it is generally preferred to feed all features into the algorithm and let it learn which ones matter. However, when the training set is small, feature selection remains a highly useful technique to prevent overfitting and manage variance.
Key points:
- Feature selection reduces variance but might increase bias.
- Small feature reductions are unlikely to cause large bias increases, unlike significant reductions.
- With plentiful data in modern deep learning, practitioners generally provide all features to the algorithm.
- Feature selection is very useful when the training set is small.
Rubric: Award full credit if the answer identifies the core trade-off (variance reduction vs. bias increase), explains the impact of reduction scale (small vs. large reductions), and correctly applies the context of dataset size (plentiful data favors using all features; small datasets favor feature selection).
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
What two opposing effects can feature selection have on a model's error components?
Reducing input features from 1,000 to 900 is unlikely to have a large effect on model bias.
In modern deep learning with plentiful data, practitioners are more likely to give _____ features to the algorithm and let it sort out which ones to use.
Match each feature reduction scenario to its likely impact on model bias according to Machine Learning Yearning.
Order the reasoning steps for deciding whether to apply feature selection as a variance-reduction technique.
According to Andrew Ng, under which specific condition is feature selection described as 'very useful'?
In modern deep learning with plentiful data, practitioners have largely shifted away from manual feature selection.
Reducing features from 1,000 to _____ is described as a ~10× reduction that is more likely to have a significant effect on bias.
Match each concept to its correct description in the context of feature selection for variance reduction.
Order the steps for evaluating how much a proposed feature reduction will affect model bias.
Trade-offs of Feature Selection in Deep Learning
Optimizing Features for a Medical Image Classifier
Impact of Plentiful Data on Feature Selection