Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for visualization similarly to t-SNE, but also for general non-linear dimension reduction. It is possible to model the manifold with a fuzzy topological structure. The embedding is found by searching for a low-dimensional projection of the data that has the closest possible equivalent fuzzy topological structure. First of all UMAP is fast. It can handle large datasets and high dimensional data without too much difficulty, scaling beyond what most t-SNE packages can manage. This includes very high dimensional sparse datasets. UMAP has successfully been used directly on data with over a million dimensions. Second, UMAP scales well in the embedding dimension—it isn't just for visualization. You can use UMAP as a general-purpose dimension reduction technique as a preliminary step to other machine learning tasks.

Features

  • The data is uniformly distributed on a Riemannian manifold
  • Documentation available
  • The Riemannian metric is locally constant (or can be approximated as such)
  • The manifold is locally connected
  • UMAP depends upon scikit-learn, and thus scikit-learn's dependencies
  • Examples included

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

BSD License

Follow UMAP

UMAP Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of UMAP!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2024-07-31