Uniform Manifold Approximation and Projection (UMAP) is a dimension reduction technique that can be used for visualization similarly to t-SNE, but also for general non-linear dimension reduction. It is possible to model the manifold with a fuzzy topological structure. The embedding is found by searching for a low-dimensional projection of the data that has the closest possible equivalent fuzzy topological structure. First of all UMAP is fast. It can handle large datasets and high dimensional data without too much difficulty, scaling beyond what most t-SNE packages can manage. This includes very high dimensional sparse datasets. UMAP has successfully been used directly on data with over a million dimensions. Second, UMAP scales well in the embedding dimension—it isn't just for visualization. You can use UMAP as a general-purpose dimension reduction technique as a preliminary step to other machine learning tasks.

Features

  • The data is uniformly distributed on a Riemannian manifold
  • Documentation available
  • The Riemannian metric is locally constant (or can be approximated as such)
  • The manifold is locally connected
  • UMAP depends upon scikit-learn, and thus scikit-learn's dependencies
  • Examples included

Project Samples

Project Activity

See All Activity >

Categories

Machine Learning

License

BSD License

Follow UMAP

UMAP Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of UMAP!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Machine Learning Software

Registered

2024-07-31