1,090 views
1 1 vote
how to choose the number of clusters based on intertia and based on silhouette using K-means
0% Accept Rate Accepted 0 answers out of 8 questions

1 Answer

0 0 votes
Having these different approaches to determine a convenient value of k (number of clusters) should help to narrow down the options for $k$, however, we cannot make sure the value we select finally is the best or not. For example, if silhouette measure is closer to 1 when $k=5$ and for Inertia we have the "elbow" or "knee" around $k=4$ to $k= 6$, probably one of the values of 4, 5, or 6 might be a good value for the number of clusters. These two methods together help us have some estimations for the number of clusters but they do not guarantee it.

Related questions

2 2 votes
1 1 answer
908
908 views
2 2 votes
1 answers 1 answer
20.9k
20.9k views
tofighi asked Jun 26, 2019
20,913 views
Use the k-means algorithm and Euclidean distance to cluster the following 8 examples into 3 clusters:$A1=(2,10), A2=(2,5), A3=(8,4), A4=(5,8), A5=(7,5), A6=(6,4), A7=(1,2...
1 1 vote
1 1 answer
917
917 views
DavidS asked Oct 17, 2018
917 views
PCAhttps://www.youtube.com/watch?v=FgakZw6K1QQK-Meanshttps://www.youtube.com/watch?v=4b5d3muPQmA&index=10&list=PLblh5JKOoLUICTaGLRoHQDuF_7q2GfuJF&t=0s
3 3 votes
0 0 answers
533
533 views
0 0 votes
0 0 answers
1.5k
1.5k views
Frenzy asked Apr 27, 2022
1,522 views
I have a dataset with 7 labels in the target variable.X = data.drop('target', axis=1) Y = data['target'] Y.unique()array(['Normal_Weight', 'Overweight_Level_I', 'Overweig...