What is unsupervised learning
collects similar data into clusters (they get names cluster A etc but this has no semantic meaning)
What is the process of clustering
grouping similar objects into groups
What are example applications of clustering
social network analysis or marketing
Image segmentation
What are some unsupervised learning algorithms
clustering concepts
Partition-based clustering algorithms (k-means)
Hierarchical clustering (agglomerative clustering)
What is the aim of clustering algorithms
to see whether the data fall into distinct groups, with members within each
group being similar to other members in that group but different from members of
other groups
What are the steps of the k-means algorithm
What are k-means variations
What are the k-means stengths
simple and easy to implement
quite efficient
What are the weaknesses with kmeans
Why is k-means sensitive to the noise
since we are using the mean
Explain hte agglomerative clustering algorithm
WHat are the strengths of agglomerative clustering
WHat are the weakensses of agglomerative clustering
does not scale up for large data sets