Evaluating a Self-Driving Path Planning Component
Case context: You are building a self-driving car pipeline. The car detection and pedestrian detection components output bounding boxes. The path planning component takes these bounding boxes and plans a trajectory. You want to informally debug the path planning component by comparing it to human-level performance. Your colleague proposes showing a human driver the raw front-facing camera videos to see if they would plan a better path.
Question: Based on the principles of informal pipeline debugging, explain why your colleague's proposal is flawed and how you should structure the human comparison instead.
Sample answer: My colleague's proposal is flawed because it allows the human driver to see raw camera videos. This gives the human access to rich visual features that the path planning component does not receive, such as road boundaries or un-detected obstacles. To structure the comparison correctly, the human should be given only the outputs of the car and pedestrian detection components (the bounding boxes/locations) and asked to plan a path using only that restricted information. This measures how well the planning component performs relative to a human given the same inputs.
Key points:
- Identifies that raw camera video provides extra information not available to the path planner.
- Explains that the human must plan a path using only the outputs from the detection components.
- States that the goal is to compare the component's performance to a human given identical inputs.
Rubric: Learners must identify that showing raw video provides the human with more information than the component receives. They must specify that the correct setup involves providing the human only with the outputs from the detection components.
0
1
References
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Machine Learning Yearning (Deeplearning.ai)
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Component Far from Human-Level Performance as Improvement Priority
Flawed Pipeline Despite Human-Level Components
When comparing the Plan path component to human-level performance in a self-driving pipeline, what inputs should the human evaluator receive?
ML Yearning describes comparing pipeline components to human-level performance as a rigorous, procedure-based debugging technique.
In informal pipeline debugging, you ask how _____ each component is from human-level performance.
Match each self-driving pipeline component from ML Yearning to its primary output or function.
Order the informal debugging questions ML Yearning recommends asking when evaluating a self-driving car pipeline.
Why does ML Yearning require the human evaluator for path planning to use only the detection components' outputs rather than camera images?
In ML Yearning's self-driving example, both the Detect cars and Detect pedestrians components feed their outputs directly into the Plan path component.
To compare the Plan path component fairly to a human, the human must plan the route using only the _____ of the detection components.
Match each informal debugging question to the pipeline element it targets in ML Yearning's self-driving car example.
Arrange the reasoning steps a practitioner follows when diagnosing a self-driving pipeline problem using human-level comparison.
Analyzing Pipeline Component Inputs for Human-Level Performance Comparison
Evaluating a Self-Driving Path Planning Component
Input Constraint in Human-Level Comparison for Path Planning