Case Study

Evaluating Anti-Spam Feature Directions: The Server Routing Approach

Case context: Your machine learning team is brainstorming new features to improve an anti-spam system. The current system only looks at the text in the email body, but sophisticated spammers have started evading these content-based filters by constantly altering their wording.

Question: Based on the team's brainstorming directions, propose a specific non-content feature approach using the email's metadata. What information should you extract, and where do you find it?

Sample answer: The team should develop features from the email envelope or header. Specifically, they should extract information showing the set of internet servers the message passed through. This provides a strong signal based on the email's routing history rather than its text.

Key points:

  • Propose using the email envelope or header.
  • Extract data regarding the internet servers the message passed through.
  • Recognize this as a non-content, metadata-driven anti-spam direction.

Rubric: A correct response must identify the email envelope or header as the source and specify that the target information is the set of internet servers the message went through.

0

1

Updated 2026-05-27

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Machine Learning Strategy

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Yearning @ DeepLearning.AI