DBSCAN algorithm and clustering algorithm for data mining

0 votes
How do you implement DBSCAN algorithm on categorical data (mushroom data set)?

And what is a one pass clustering algorithm?
Mar 3, 2022 in Machine Learning by Nandini
• 5,480 points
840 views

1 answer to this question.

0 votes
You can use any distance function with DBSCAN without making any changes. Because the indexing will be more difficult, the complexity will most likely be O(n2).

DBSCAN, on the other hand, only computes distances, compares them to a threshold, and counts things if you look closely. This is a significant strength of it; all you need to do is design a distance function and thresholds to apply it to a variety of data types.
DBSCAN is a pairwise distance-based algorithm, hence I doubt there is a one-pass variant. Some of these computations can be pruned (this is where the index comes in), but you still need to compare each item to each other, so it's O(n log n) and not one-pass.

The original k-means algorithm, I believe, was a one-pass algorithm. Your first means consist of the first k things. You choose the nearest mean for each new object and update it (incrementally) with the new object for each new object. This was a "one-pass" process as long as you didn't iterate over your data set again. (However, the outcome will be worse than lloyd-style k-means.)
answered Mar 4, 2022 by Dev
• 6,000 points

Related Questions In Machine Learning

0 votes
1 answer
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How to find out cluster center mean of DBSCAN in R?

Just index back into the original data ...READ MORE

answered Jun 27, 2018 in Data Analytics by Sahiti
• 6,370 points
1,194 views
0 votes
1 answer

Plot a k-distance graph in python

You probably want to use the matrix ...READ MORE

answered Sep 5, 2018 in Python by Priyaj
• 58,090 points
4,691 views
0 votes
1 answer

hclust size limit?

Classic hierarchical clustering approaches are O(n^3) in ...READ MORE

answered Jul 10, 2019 in Python by SDeb
• 13,300 points
475 views
0 votes
1 answer

Hierarchical clustering of 1 million objects

Consider switching the algorithm instead of using ...READ MORE

answered Feb 24, 2022 in Machine Learning by Nandini
• 5,480 points
346 views
0 votes
1 answer

Orange Data Mining and Logistic Regression

A statistical classification method that fits data ...READ MORE

answered Mar 15, 2022 in Machine Learning by Dev
• 6,000 points
1,148 views
0 votes
1 answer

Minimum support and minimum confidence in Data Mining

Yes, there is a mechanism for calculating ...READ MORE

answered Apr 4, 2022 in Machine Learning by Dev
• 6,000 points
2,742 views
webinar REGISTER FOR FREE WEBINAR X
REGISTER NOW
webinar_success Thank you for registering Join Edureka Meetup community for 100+ Free Webinars each month JOIN MEETUP GROUP