Information for hyper-parameter tuning code. More...
Detailed Description
The Silhouette Score is a metric of performance for clustering that represents the quality of clusters made as a result.
It provides an indication of goodness of fit and therefore a measure of how well unseen samples are likely to be predicted by the model, considering the inter-cluster and intra-cluster dissimilarities. Silhoutte Score is dependent on the metric used to calculate the dissimilarities. The best possible score is . Smaller values of Silhouette Score indicate poor clustering. Negative values would occur when a wrong label was put on the element. Values near zero indicate overlapping clusters. For an element i is within cluster average dissimilarity and is minimum of average dissimilarity from other clusters. the Silhouette Score of a Sample is calculated by
The Overall Silhouette Score is the mean of individual silhoutte scores.