A New Axis of Sparsity for Large Language Models
LLM training in simple, raw C/CUDA
An open source python library for automated feature engineering
A python library for self-supervised learning on images
An AI for Music Generation
Enhances Tesseract OCR output using LLMs (local or API)
Self-learning data agent that grounds its answers in layers of content
Language modeling in a sentence representation space
Data Lake for Deep Learning. Build, manage, and query datasets
Retrieval Augmented Generation (RAG) framework
High-Fidelity and Controllable Generation of Textured 3D Assets
Open-source MCP server that gives your coding agent
Ship RAG based LLM web apps in seconds
Beyond the Imitation Game collaborative benchmark for measuring
Code for the paper Hybrid Spectrogram and Waveform Source Separation
A large open dataset + tools to speed up MRI scans using ML
800,000 step-level correctness labels on LLM solutions to MATH problem
Contextually-keyed word vectors
NLP, before and after spaCy
WaveRNN Vocoder + TTS
Task of transcribing piano recordings into MIDI files
Machine learning tool that allows you to train and test models
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Large-scale autoregressive pixel model for image generation by OpenAI
Deep Hough Voting for 3D Object Detection in Point Clouds