Concept

DBSCAN Clustering

  • Density based spatial clustering of applications with noise
  • Don’t need to specify # of clusters
  • Relatively efficient -Identifies likely noise points
  • Parameters: - Neighbor count parameter: min_samples - Neighborhood radius parameter: eps
  • Core point: point that lie in more dense region
  • For a given data point, if there are min_sample other data points that lie within a distance of eps, that given data point is labelled as a core sample
  • All core samples with a distance of eps units apart are put in the same cluster
  • Noise: points that don’t belong in any cluster
  • Boundary point: points within a distance of eps units from core points but not core points themselves
Image 0

0

1

Updated 2021-02-20

Tags

Data Science