K-means clustering
K-means clustering
Prototype-based clustering
• Each cluster represented by a prototype
• Other names: centroid clustering, centre-based clustering
• Example: customer segmentation. Each segment has a prototype customer and
customers similar to him/her are associated with that cluster.
Centroid - real/imaginary data point with mean characteristics of all the data points
within the cluster
K-means clustering - how to?
Why use k-means?
Strengths:
K-means weaknesses
Hierarchical clustering
* Output: a hierarchy of potential clusterings - dendrogram.
Why use hierarchical clustering?
Strengths:
• Clusters can be of any size and shape.
• Does not require to prespecify the number of clusters.
Hierarchical clustering weaknesses
* Computationally inefficient.