Interpret the key results for Cluster K-Means

Complete the following steps to interpret a cluster k-means analysis. Key output includes the observations and the variability measures for the clusters in the final partition.

Step 1: Examine the final groupings

Examine the final groupings to see whether the clusters in the final partition make intuitive sense, based on the initial partition you specified. Check that the number of observations in each cluster satisfies your grouping objectives. If one cluster contains too few or too many observations, you may want to re-run the analysis using another initial partition.

Method

Number of clusters3
Standardized variablesYes

Final Partition

Number of
observations
Within
cluster
sum of
squares
Average
distance
from
centroid
Maximum
distance
from
centroid
         
Cluster141.5930.5780.884
Cluster288.7360.9641.656
Cluster31012.9211.0931.463

Cluster Centroids

VariableCluster1Cluster2Cluster3Grand
centroid
         
Clients1.23180.5225-0.91080.0000
Rate of Return1.29420.2217-0.69500.0000
Sales1.18660.5157-0.88720.0000
Years1.20300.5479-0.91950.0000

Distances Between Cluster Centroids

Cluster1Cluster2Cluster3
       
Cluster10.00001.59154.1658
Cluster21.59150.00002.6488
Cluster34.16582.64880.0000
Key Results: Final partition

In these results, Minitab clusters data for 22 companies into 3 clusters based on the initial partition that was specified. Cluster 1 contains 4 observations and represents larger, established companies. Cluster 2 contains 8 observations and represents mid-growth companies. Cluster 3 contains 10 observations and represents young companies. A business analyst believes that these final groupings are adequate for the data.

Note

To see which cluster each observation belongs to, you must enter a storage column when you perform the analysis. Minitab stores the cluster membership for each observation in a column in the worksheet.

Step 2: Assess the variability within each cluster

Examine the variability of the observations within each cluster, using the distance from centroid measures. Clusters with higher values exhibit greater variability of the observations within the cluster. If the difference in variability between clusters is too high, you may want to re-run the analysis using another initial partition.

Method

Number of clusters3
Standardized variablesYes

Final Partition

Number of
observations
Within
cluster
sum of
squares
Average
distance
from
centroid
Maximum
distance
from
centroid
         
Cluster141.5930.5780.884
Cluster288.7360.9641.656
Cluster31012.9211.0931.463

Cluster Centroids

VariableCluster1Cluster2Cluster3Grand
centroid
         
Clients1.23180.5225-0.91080.0000
Rate of Return1.29420.2217-0.69500.0000
Sales1.18660.5157-0.88720.0000
Years1.20300.5479-0.91950.0000

Distances Between Cluster Centroids

Cluster1Cluster2Cluster3
       
Cluster10.00001.59154.1658
Cluster21.59150.00002.6488
Cluster34.16582.64880.0000
Key Results: Average distance from centroid

In these results, the average distance from centroid is lowest for Cluster 1 (0.578) and highest for Cluster 3 (1.093). This indicates that Cluster 1 has the least variability and Cluster 3 has the most variability. However, Cluster 1 has the fewest observations (4) and Cluster 3 has the most observations (10), which may partly explain the difference in variability.