1Cademy - Overview of working of T-SNE

Learn Before

T-Distributed Stochastic Neighbour Embedding (T-SNE)

Concept

Overview of working of T-SNE

The algorithm starts by calculating the probability of similarity of points in high-dimensional space and calculating the probability of similarity of points in the corresponding low-dimensional space. The similarity of points is calculated as the conditional probability that a point A would choose point B as its neighbor if neighbors were picked in proportion to their probability density under a Gaussian (normal distribution) centered at A. It uses T-test from the T-distribution. It then tries to minimize the difference between these conditional probabilities (or similarities) in higher-dimensional and lower-dimensional space for a perfect representation of data points in lower-dimensional space. To measure the minimization of the sum of the difference of conditional probability t-SNE minimizes the sum of Kullback-Leibler divergence of overall data points using a gradient descent method.

Updated 2021-02-20

Contributors are:

Miaowei Wang

🏆 1

Who are from:

University of Michigan - Ann Arbor

🏆 1

References

Visualizing Data using t-SNE

Learn Before

Related