Choose the appropriate medical expert reference for a system with 40% error.
Case context: A machine learning team is developing an image recognition system for medical diagnostics. The system currently misclassifies 40% of the images. The team must choose between having the dataset labeled by a group of junior doctors who have a 10% error rate, or hiring a much more expensive panel of expert specialists who have a 5% error rate.
Question: Given the current 40% error rate of the medical image recognition system, which group of medical professionals should the team prioritize for labeling data and providing intuitions? Explain your decision based on the principles of human-level error reference.
Sample answer: The team should choose the junior doctors. Because the system's current error rate is very high at 40%, using a junior doctor with a 10% error rate versus an expert with a 5% error rate does not matter much. The gap between 40% and 10% provides more than enough room and sufficient tools to guide the improvement of the algorithm without needing the more precise 5% benchmark.
Key points:
- Diagnose the current system error as high (40%).
- Determine that the difference between 10% and 5% human error references is negligible at this stage.
- Decide that the less precise (junior doctor) reference is sufficient to provide tools for system improvement.
Rubric: A correct response identifies that the junior doctors are sufficient and explains that at a 40% system error rate, the difference between a 10% and 5% human-level reference is practically insignificant for providing intuitions.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Why Comparing to Human-Level Performance Helps ML Development
When a system has 40% error, which human-level reference is most useful for guiding improvement?
When a system's error rate is 40%, choosing a junior doctor (10% error) vs. an experienced doctor (5% error) as the human-level reference makes little practical difference.
If your system is already at _____ % error, defining the human-level reference as 2% gives better tools to keep improving.
Match each system error scenario to its implication for selecting a human-level reference.
Order the reasoning steps for deciding whether precision in the human-level reference matters for your system.
Why does a 2% human-level reference give better improvement tools when a system is at 10% error, compared to when it is at 40% error?
At high system error rates such as 40%, a more precise human-level reference always provides significantly better guidance than a less precise one.
According to Machine Learning Yearning, human error rate can be used as the _____ error rate when hoping for human-level performance.
Match each annotator type or system scenario to its description from Machine Learning Yearning.
Order the steps in the decision process for selecting an appropriate human-level error reference for an ML system.
Explain the relationship between current system error and the choice of human-level reference.
Choose the appropriate medical expert reference for a system with 40% error.
Benefit of a precise human-level reference for a low-error ML system.