What is the one line definition of Unsupervised Learning?
A type of algorithm that learns hidden patterns from unlabelled data
What are two primary aspects of Unsupervised Learning?
Cluster Analysis - Divide data into meaningful groups
Dimensionality Reduction - PCA, Autoencoders, Generative Models
What are the 4 key aspects of Cluster Analysis?
What 4 common Similarity Functions are in Unsupervised Learning?
Euclidean Distance
Cosine Distance
Manhattan Distance
Jaccard Distance
What is the equation for Euclidean Distance?
Euclidean Distance = Square Root of ((A - B)^T * (A - B))
Where:
A, B: Column Vectors that contain features of two data samples
What is the equation for Cosine Distance?
Cosine Distance: cos(theta) = (A * B) / (||A|| * ||B||)
Where:
A, B: Column Vectors that contain features of two data samples
What is the equation for the Manhattan Distance?
Manhattan Distance = 1/n (n sigma) |A - B|
Where:
A, B: Column Vectors that contain features of two data samples
What is the equation for the Jaccard Distance?
Jaccard Distance = |A intersection B| / |A union B|
Where:
A, B: Column Vectors that contain features of two data samples
What is the step-by-step method of K-Means Algorithm?
What are the advantages of K-Means Algorithm?
Simple
Efficient
What are the disadvantages of K-Means Algorithm?
Solution dependent on the initialisation
Need to specify number of clusters
Sensitive to Outliers
What is the step-by-step method of Agglomerative Hierarchical Clustering?
What are the advantages of Agglomerative Hierarchical Clustering?
Flexible with number of clusters
Can capture hierarchical relationship/s
What are the disadvantages of Agglomerative Hierarchical Clustering?
Solution is local optimum, dependent on subject functions e.g. minimum, maximum, group average, etc…
Requires larger memory and longer computational time
What is the step-by-step method for Density-Based Spatial Clustering of Applications with Noise (DBSCAN)?
What are the advantages of DBSCAN?
Robust to outliers
Learn non-regular population density patterns
Automatically determine the number of clusters
What are the disadvantages of DBSCAN?
Not robust to variable density clusters (due to single E being used)
Computationally expensive
Sensitive to parameter settings
What is the step-by-step method for Expectation Maximisation?
What are the advantages of Expectation Maximisation?
Soft clustering
What are the disadvantages of Expectation Maximisation?
Restricted by the distribution model
Sensitive to initialisation
Need to specificity the number of clusters
What is a sign of a good Clustering Analysis?
If the clustering algorithm separates dissimilar data samples apart and similar data samples together, then it has performed well.