Data Science

Similarities are computed from distances using a kernel different from the t-SNE kernel
- i.e. it is not Gaussian
- Similar to t-SNE, it decays exponentially and has adaptive width
Similarities are not normalized to sum up to $1$
- However, the similarity values still end up being normalized to sum up to a constant value
Similarity values are symmetrized

UMAP and t-SNE are both useful for visualizations
UMAP is faster compared to t-SNE
- UMAP can complete embedding in less than a minute on 70k samples with 784 features
- t-SNE completes embedding in around 45 minutes on 70k samples with 785 features
Similar results with random and informative initialization:
- With random and informative initialization, t-SNE and UMAP both are able to preserve local structures
- With random initialization, t-SNE and UMAP both struggle to preserve global structures
- With informative random initialization, t-SNE and UMAP both are able to preserve global structures

t-SNE