Real time face swap and one-click video deepfake
The most powerful and modular diffusion model GUI, api and backend
Agentic, Reasoning, and Coding (ARC) foundation models
Official Python inference and LoRA trainer package
NVR with realtime local object detection for IP cameras
AutoML library for deep learning
LLM based autonomous agent that does online comprehensive research
Generate audiobooks from e-books
Turns Data and AI algorithms into production-ready web applications
Tokenizer-Free TTS for Multilingual Speech Generation
A Powerful Native Multimodal Model for Image Generation
Scalable RL solution for advanced reasoning of language models
A community-supported supercharged version of paperless
Pretrained time-series foundation model developed by Google Research
High-Quality Voice Cloning TTS for 600+ Languages
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Open Source Document Management System for Digital Archives
Fast backend for long-term AI user memory via structured profiles
An AI-powered security review GitHub Action using Claude
An Open Source text-to-speech system built by inverting Whisper
A simple but complete full-attention transformer
Instant voice cloning by MIT and MyShell. Audio foundation model
Framework for orchestrating role-playing, autonomous AI agents
The AI Assistant that actually does things for the trades
Collection of Gemma 3 variants that are trained for performance