3D reconstruction software
GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Agentic, Reasoning, and Coding (ARC) foundation models
Focus on prompting and generating
Industry leading face manipulation platform
A Simple and Universal Swarm Intelligence Engine
Personal AI, On Personal Devices
TTS with kokoro and onnx runtime
The most powerful and modular diffusion model GUI, api and backend
Stable Diffusion web UI
Deep Research framework, combining language models with tools
Run Local LLMs on Any Device. Open-source
Public repository for Agent Skills
A simple, high-quality voice conversion tool focused on ease of use
OCRmyPDF adds an OCR text layer to scanned PDF files
AI video generator optimized for low VRAM and older GPUs use
The agent that grows with you
Open-source, high-performance AI model with advanced reasoning
Official Python inference and LoRA trainer package
Native and Compact Structured Latents for 3D Generation
gpt-oss-120b and gpt-oss-20b are two open-weight language models
The most powerful local music generation model
Robust Speech Recognition via Large-Scale Weak Supervision