Learn Before
Propose an averaging strategy to combine classification accuracy metrics from US, China, India, and other markets into a single evaluation metric.
Case context: Your machine learning team is developing a cat classifier and separately tracking its accuracy across four markets: US, China, India, and Other. The product manager notes that the China market has ten times the active user base of the US market, yet the US market generates the highest revenue per user. The team needs to evaluate model iterations using a single-number evaluation metric.
Question: Based on the provided context, decide whether the team should use a simple average or a weighted average to combine these four market metrics, and justify how they should construct the single formula to guide their optimization process.
Sample answer: The team should use a weighted average rather than a simple average to combine the four market metrics. Because the China and US markets have different user volumes and financial values, a simple average would treat all regions equally, leading to suboptimal model selection. By using a weighted average, the team can assign mathematical weights to each region's accuracy metric (for example, weighting by user volume or revenue contributions) to create a single-number evaluation metric that reflects their combined strategic priorities.
Key points:
- Recommend a weighted average over a simple average to combine the four regional metrics.
- Explain that simple averaging fails to account for differences in user volume or revenue value among markets.
- Propose assigning weights to each of the four markets based on business value, traffic, or strategic importance.
- Create a single-number evaluation metric formula that combines all four numbers to guide optimization.
Rubric: The response must recommend a weighted average, explain why a simple average is inadequate given the market disparities, and describe how weights should be assigned based on strategic factors like user volume or revenue.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
When a cat classifier's accuracy is tracked across four regional markets, which method does Andrew Ng recommend for combining these into a single-number metric?
True or False: Taking an average or weighted average of multiple accuracy metrics is one of the most common ways to combine them into a single-number metric.
By taking an average or weighted average of accuracy metrics across four regional markets, you end up with a _____ metric.
What does taking an average of accuracy scores from four key markets produce?
Taking an average or weighted average of multiple metrics is one of the most common ways to combine them into a single number metric.
Tracking your cat classifier's accuracy separately in four key markets gives you _____ metrics before combining.
Match each combining strategy or concept to its correct description.
Order the steps for converting four market accuracy scores into a single evaluation metric.
Why might a team prefer a weighted average over a simple average when combining market accuracy metrics?
In Andrew Ng's four-market example, each of the four regions contributes exactly one accuracy metric.
Taking an average or weighted average of multiple metrics is one of the most _____ ways to combine them into a single number.
Match each concept to its role in creating a single-number evaluation metric from multiple market scores.
Order the reasoning steps a team follows when deciding to combine multiple market metrics via averaging.
Compare the impacts of simple and weighted averaging of regional accuracies on a classifier's evaluation metric.
Propose an averaging strategy to combine classification accuracy metrics from US, China, India, and other markets into a single evaluation metric.
Explain the core benefit of combining market-specific accuracy scores into a single-number evaluation metric.