K Means using elbow method

0 votes
How to use elbow method to find the best no of clusters to use K Means?
Aug 25, 2019 in Data Analytics by praneel
953 views

1 answer to this question.

0 votes

Elbow method allows the user to know the best fit number of clusters.

Follow the below steps:

  1. Compute clustering algorithm (e.g., k-means clustering) for different values of k. For instance, by varying k from 1 to 10 clusters.
  2. For each k, calculate the tot.withinss.
  3. Plot the curve of above values against the number of clusters from step 1.
  4. The value at the bend of the plot is considered as the best fit value for no of clusters.
totwss=sapply(1:10, function(k) { kmeans(mtcars$mpg,k)$tot.withinss})
k = data.frame(k = 1:6,totwss = totwss)
ggplot(k,aes(k,totwss))+geom_line()

In this case, you can take 2 or 3 as per your choice.

answered Aug 26, 2019 by anonymous
• 32,490 points

Related Questions In Data Analytics

+1 vote
1 answer

Join using two mappers - invalid inputfile exception

Dear Learner, Hope you are doing well. Can you ...READ MORE

answered Dec 18, 2017 in Data Analytics by Sudhir
• 1,610 points
77 views
0 votes
1 answer

Using "dplyr" to summarise multiple columns

You can use the "sumamrise_all()" function for ...READ MORE

answered Mar 26, 2018 in Data Analytics by Bharani
• 4,560 points
863 views
0 votes
1 answer

Left Join and Right Join using "dplyr"

The below is the code to perform ...READ MORE

answered Mar 26, 2018 in Data Analytics by Bharani
• 4,560 points
262 views
0 votes
1 answer

How to change y axis max in time series using R?

The axis limits are being set using ...READ MORE

answered Apr 3, 2018 in Data Analytics by Sahiti
• 6,320 points
898 views
0 votes
1 answer

Use different distance formula other than euclidean distance in k means

K-means is based on variance minimization. The sum-of-variance formula ...READ MORE

answered Jun 21, 2018 in Data Analytics by Sahiti
• 6,320 points
638 views
+1 vote
1 answer

k means vs KNN

K-means clustering is basically an unsupervised clustering ...READ MORE

answered Oct 30, 2018 in Data Analytics by kurt_cobain
• 9,320 points
314 views
0 votes
1 answer

Big Data transformations with R

Dear Koushik, Hope you are doing great. You can ...READ MORE

answered Dec 17, 2017 in Data Analytics by Sudhir
• 1,610 points
182 views
0 votes
2 answers

Transforming a key/value string into distinct rows in R

We would start off by loading the ...READ MORE

answered Mar 26, 2018 in Data Analytics by Bharani
• 4,560 points
216 views
0 votes
2 answers

How to change the value of a variable using R programming in a data frame?

You are getting this error since the ...READ MORE

answered Aug 16, 2019 in Data Analytics by anonymous
• 32,490 points
13,268 views
0 votes
1 answer

How to edit the labels and limit if a plot using ggplot? - R

Add a limit to axis ticks using ...READ MORE

answered Nov 3, 2019 in Data Analytics by anonymous
• 32,490 points
70 views