What is t-SNE?
links:
https: //lvdmaaten.github.io/publications/papers/JMLR_2014.pdf
https: //www.youtube.com/watch?v=RJVL80Gg3lA
embedding technique commonly used for visualizing high-dim data in scatter plots
similarity in stochastic neighbor embedding (SNE)
in SNE one computes a NxN similarity matrix in both the original and the low-dim data spaces; each matrix forms a probability distribution over pair of objects:
probability distributions in SNE
represented usually by normalized Gaussian or Student-t kernel
learning the SNE
low-dim embedding is learned by minimizing the KL between the 2 distributions wrt to the locations of the points in the embedding