reason for clustering
categorize data without labels
cluster searching
** Note: much faster than searching each document
k-means training
responsibility

fuzzy k-means (soft k-means) algorithm
k-means cost function
where k-means fails
k-means problems
ideal k value
hierarchical (agglomerative) clustering
joining cluster
dendrogram

distances equations

valid distance metrics
self-organized maps
gaussian mixure model (GMM)
GMM algorithm

responsibility k-means v. GMM
independent component analysis (ICA)
benefits of GMM over fuzzy k-means
GMM v. Fuzzy K-means

singular covariance problem
ways of dealing with singular covariance problem
diagonal covariance
