Industry leading face manipulation platform
Run Local LLMs on Any Device. Open-source
Universal LLM Deployment Engine with ML Compilation
Machine learning on FPGAs using HLS
Fast stable diffusion on CPU and AI PC
Personal AI, On Personal Devices
DeepMind's software stack for physics-based simulation
AirLLM 70B inference with single 4GB GPU
Official repository for LTX-Video
TTS with kokoro and onnx runtime
FlashInfer: Kernel Library for LLM Serving
Apple Silicon (MLX) port of Karpathy's autoresearch
Official inference repo for FLUX.1 models
Open-source, code-first Python toolkit for building, evaluating, etc.
The most powerful local music generation model
Any model. Any hardware. Zero compromise
Parallax is a distributed model serving framework
Official inference framework for 1-bit LLMs
Generate audiobooks from e-books, voice cloning & 1107+ languages
AI video generator optimized for low VRAM and older GPUs use
Performance-optimized AI inference on your GPUs
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Accessible large language models via k-bit quantization for PyTorch
Accelerate local LLM inference and finetuning
Open deep learning compiler stack for cpu, gpu, etc.