Learn Before
Evaluating dev set representativeness for a multi-region deployment.
Case context: A machine learning team is developing an application that needs to perform well across four distinct geographic regions. However, they are considering building their dev set using readily available data from only two of those regions for convenience.
Question: If the team proceeds with a dev set comprising data from only two regions, what will their optimization efforts focus on, and why? What must they do to align their efforts with their true deployment goal?
Sample answer: If the team uses a dev set from only two regions, their optimization efforts will focus solely on improving performance in those two regions, because a team's primary focus inherently becomes improving dev set performance. To align with their true goal of doing well across all four geographies, they must ensure their dev set reflects that broader task by including representative data from all four regions.
Key points:
- Optimization naturally centers entirely on the existing dev set.
- The proposed dev set leads to ignoring two critical target regions.
- The dev set must be expanded to include data from all four regions to reflect the true task.
Rubric: The response should state that the team will end up optimizing only for the two regions present in the dev set, and recommend redesigning the dev set to include data from all four target regions.
0
1
Tags
Machine Learning
Deep Learning
Machine Learning Strategy
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Yearning @ DeepLearning.AI
Related
Why should the dev set reflect the task you most want to improve on?
After defining dev and test sets, a team's primary optimization focus becomes improving dev set performance.
The dev set should reflect the _____ the team most wants to improve on.
Match each dataset or concept to its role in the ML Yearning framework for dev and test sets.
Order the steps in the reasoning process for ensuring a dev set reflects the target task.
A team wants performance across four geographies but their dev set covers only two. What is the likely consequence?
A dev set covering only a subset of target deployment scenarios will cause the team to optimize for an incomplete version of the task.
Once you define the dev and test sets, your team will be focused on improving _____ performance.
Match each dev set design choice to the optimization outcome it produces.
Order the consequences that unfold when a dev set does NOT reflect the full target task.
Explain the relationship between dev set composition and optimization focus.
Evaluating dev set representativeness for a multi-region deployment.
Identify the primary criterion for a dev set's composition.