The Role of Luck in Different Distributions
Question: Explain what "luck" refers to when training an algorithm on dataset A and testing it on a very different dataset B, and discuss its impact on the academic study of this topic.
Sample answer: When training an algorithm on dataset A and testing it on a completely different dataset B, "luck" refers to the researcher's hand-designed features tailored for the particular task, as well as other unknown or misunderstood factors. This reliance on luck can have a huge effect on how well the algorithm performs. Because the performance is so dependent on these unsystematic factors, it makes the academic study of training and testing on different distributions very difficult to carry out in a rigorous, systematic way.
Key points:
- Luck includes the researcher's hand-designed features.
- Luck includes other factors that are not yet understood.
- These factors can have a huge effect on algorithm performance on different data.
- This makes academic study difficult to carry out systematically.
Rubric: A strong response will define luck in this context (hand-designed features, unknown factors), state that it heavily affects performance, and conclude that this makes academic study difficult to conduct systematically.
0
1
References
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Applicability of Domain Adaptation
Gap Between Theory and Practice
Generalizing to Different Distributions _____
Concepts in Training on Different Distributions
The Challenge of Testing on Different Data
The Role of Luck in Different Distributions
Implementing Domain Adaptation in a Startup
Gap in Different Distribution Research
Factors Affecting Performance on Different Data
Widespread Use of Domain Adaptation