The goals of project are described in the web page https://lakhcleananalysis.sourceforge.io/
and in the YouTube video referenced here.
Presently, we are analyzing the note onset distribution, the pitch class distribution, and the midi program assignments for the entire dataset. Each of these entities are represented by a separate vector for each midi file. These vectors were clustered using the kmeans and HDBSCAN algorithms. The vectors (midi files) were projected into a two dimensional space using the UMAP algorithm. A user interface, umapPlot.tcl is provided to explore this space.
Features
- Displays the UMAP mapping for one of three feature sets derived from the Lakh Clean Midi Dataset.
- On zooming into one of the regions of the UMAP mapping, details associated with each data sample (midi file) is displayed.
Follow LakhCleanAnalysis
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of LakhCleanAnalysis!