What input restrictions apply when comparing a pipeline component to human performance?
Question: When evaluating a single pipeline component (e.g., 'Plan path' in a self-driving system) against human-level performance, what specific restriction must be placed on the information given to the human?
Sample answer: To properly evaluate the component, the human must be given only the exact same inputs that the specific pipeline component receives (such as the outputs from the previous components), rather than having access to the original raw data like camera images.
Key points:
- The human must receive the exact same input as the algorithm component.
- The human should be restricted to the previous components' outputs, not the raw data.
Rubric: The answer must specify that the human evaluator is restricted to using only the outputs from the preceding pipeline components, ensuring a fair comparison.
0
1
References
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Use the Best Achievable Human Team to Define Human-Level Performance
Measuring Human Error on Mobile-Image Examples
High Variance Indicated by 1% Training Error and 5% Training-Dev Error
Why is the optimal error rate for cat recognition nearly 0% according to Machine Learning Yearning?
If 14% of audio clips are too noisy for humans to understand, the optimal speech recognition error rate is approximately 14%.
Human-level performance is used as a proxy to estimate the _____ error rate on a given task.
Match each task scenario to the approximate optimal error rate implied by human-level performance.
Order the steps for using human-level performance to estimate optimal error rate and guide bias reduction.
An algorithm achieves 10% error on a task where humans achieve 2% error. What is the avoidable bias and what action does this suggest?
Human-level performance always equals 0% error, so the optimal error rate is always 0% for any machine learning task.
In the cat recognition example, because a human can recognize cats almost all the time, the ideal error rate is nearly _____.
Match each key term to its definition in the context of human-level performance as an optimal error rate proxy.
Order the reasoning steps for deciding whether a task's optimal error rate is near 0% or substantially higher.
Explain how comparing algorithm and human-level performance guides bias reduction.
What desired error rate should be set for a speech recognition system with 14% noisy data?
What input restrictions apply when comparing a pipeline component to human performance?