Analyze the efficacy and practical limitations of adding a source indicator feature to resolve inconsistent auxiliary data.
Question: Explain how adding an extra feature indicating the data source (such as the city in a housing-price model) can resolve data inconsistency, and analyze the practical limitations of this approach according to the source material.
Sample answer: Adding a source indicator feature (such as the city) to each training example modifies the input x to explicitly denote the data origin. This resolves the inconsistency because given the specified input x, the target value y becomes unambiguous. However, a major practical limitation is that this approach is not frequently used or observed in practice.
Key points:
- Adding a source indicator feature (e.g., city) directly to each training example input x.
- The addition makes the target value y unambiguous for any given input x.
- In practical applications, this approach is not frequently observed or implemented.
Rubric: The response must explain that specifying the source in the input x makes the target value y unambiguous, thereby resolving the inconsistency. It must also identify that a practical limitation is its low frequency of use in real-world applications.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Related
What is the primary effect of adding a city indicator feature to training examples that mix Detroit and New York City housing data?
True or False: Andrew Ng reports that adding a source indicator feature to resolve inconsistent auxiliary data is a frequently used technique in practice.
When a city indicator is added as a feature to each training example, the target value y becomes _____ given the input x.
What can be added to each training example to resolve inconsistencies when data comes from multiple sources?
Adding a source indicator feature (e.g., city) to input x makes the target value y unambiguous.
Given an input x that specifies the city, the target value of _____ is now unambiguous.
Match each concept in the source indicator approach to its correct description.
Order the steps to apply the source indicator feature approach to inconsistent training data.
How does Andrew Ng characterize the practical adoption of the source indicator feature approach for inconsistent data?
Andrew Ng presents the source indicator feature approach as a frequently used best practice for handling inconsistent data.
In the housing-price example, adding a feature indicating the _____ to each training example can resolve inconsistencies across data sources.
Match each scenario to the role it plays in the source indicator feature approach.
Order the reasoning steps Andrew Ng uses to introduce and assess the source indicator feature approach.
Analyze the efficacy and practical limitations of adding a source indicator feature to resolve inconsistent auxiliary data.
Evaluate the use of a source indicator for resolving inconsistencies in a multi-city housing model.
Explain the theoretical effect of specifying the city in a housing-price input when combining inconsistent data sources.