Open source multimodal creative AI assistant with infinite canvas tool
Renderer for the harmony response format to be used with gpt-oss
Agent framework and applications built upon Qwen>=3.0
GitLab automatic code review tool based on large models
A graphical manager for ollama that can manage your LLMs
GUI for a Vocal Remover that uses Deep Neural Networks
A Simple and Universal Swarm Intelligence Engine
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Deep Research framework, combining language models with tools
Industry leading face manipulation platform
Focus on prompting and generating
Stable Diffusion web UI
Official Python inference and LoRA trainer package
Run Local LLMs on Any Device. Open-source
TTS with kokoro and onnx runtime
The most powerful and modular diffusion model GUI, api and backend
The agent that grows with you
A simple, high-quality voice conversion tool focused on ease of use
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
3D reconstruction software
Powerful AI language model (MoE) optimized for efficiency/performance
Public repository for Agent Skills
OCRmyPDF adds an OCR text layer to scanned PDF files