A Web UI for easy subtitle using whisper model
Modular AI image and video generation web UI with extensible tools
End-to-end pipeline converting generative videos
Your Personal AI Assistant; easy to install, deploy on local or coud
Time-lapse Video Generation Models as Metamorphic Simulators
Context-aware desktop AI assistant that understands screen content
Multilingual speech recognition and audio understanding model
InvokeAI is a leading creative engine for Stable Diffusion models
Offline inference engine for art, real-time voice conversations
Speech-AI-Forge is a project developed around TTS generation model
Python tool for browser-based interactive data apps in one file
Document Image Parsing via Heterogeneous Anchor Prompting”
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Synthetic data generators for tabular and time-series data
AI coding workstation: Claude Code + web UI + 5 AI CLIs + headless
A sound cloning tool with a web interface, using your voice
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Spark-TTS Inference Code
Fast-stable-diffusion + DreamBooth
Real-time Claude Code usage monitor with predictions and warnings
The most powerful Android RPA agent framework
Determined, deep learning training platform
Python chatbot framework with Natural Language Understanding
Custom Chinese chatbot with Seq2Seq, GPT, and agent features
An open sourced end-to-end VLM-based GUI Agent