Framework to prove inference of ML models blazingly fast
Synthetic data curation for post-training and data extraction
NeurIPS2025 Spotlight] Quantized Attention
A static type analyzer for Python code
A high-performance ML model serving framework, offers dynamic batching
Official inference repo for FLUX.2 models
Machine learning image inpainting task that removes watermarks
Framework which allows you transform your Vector Database
Fully private LLM chatbot that runs entirely with a browser
PyTorch extensions for fast R&D prototyping and Kaggle farming
Create HTML profiling reports from pandas DataFrame objects
From-scratch PyTorch implementation of Google's TurboQuant
A real time inference engine for temporal logical specifications
The official repo of Qwen chat & pretrained large language model
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Towards Real-World Vision-Language Understanding
Set of comprehensive computer vision & machine intelligence libraries
LLM.swift is a simple and readable library
Genome modeling and design across all domains of life
Open-source LLM load balancer and serving platform for hosting LLMs
The best ChatGPT that $100 can buy
Research code artifacts for Code World Model (CWM)
Open-source large language model family from Tencent Hunyuan
1 min voice data can also be used to train a good TTS model
Trainable models and NN optimization tools