Linear Digressions

Unsupervised Dimensionality Reduction: UMAP vs t-SNE


Listen Later

Dimensionality reduction redux: this episode covers UMAP, an unsupervised algorithm designed to make high-dimensional data easier to visualize, cluster, etc. It’s similar to t-SNE but has some advantages. This episode gives a quick recap of t-SNE, especially the connection it shares with information theory, then gets into how UMAP is different (many say better).
Between the time we recorded and released this episode, an interesting argument made the rounds on the internet that UMAP’s advantages largely stem from good initialization, not from advantages inherent in the algorithm. We don’t cover that argument here obviously, because it wasn’t out there when we were recording, but you can find a link to the paper below.
Relevant links:
https://pair-code.github.io/understanding-umap/
https://www.biorxiv.org/content/10.1101/2019.12.19.877522v1
...more
View all episodesView all episodes
Download on the App Store

Linear DigressionsBy Ben Jaffe and Katie Malone

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

352 ratings


More shows like Linear Digressions

View all
Global News Podcast by BBC World Service

Global News Podcast

7,655 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

476 Listeners

The Daily by The New York Times

The Daily

110,824 Listeners

Up First from NPR by NPR

Up First from NPR

55,990 Listeners

What's That Rash? by ABC listen

What's That Rash?

245 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,363 Listeners

Prof G Markets by Vox Media Podcast Network

Prof G Markets

1,166 Listeners