GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Focus on prompting and generating
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
Run Local LLMs on Any Device. Open-source
Stable Diffusion web UI
A Simple and Universal Swarm Intelligence Engine
Industry leading face manipulation platform
OCRmyPDF adds an OCR text layer to scanned PDF files
The most powerful and modular diffusion model GUI, api and backend
A simple, high-quality voice conversion tool focused on ease of use
Wan2.2: Open and Advanced Large-Scale Video Generative Model
OCR software, free and offline
3D reconstruction software
Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Code for running inference and finetuning with SAM 3 model
Open-source, high-performance AI model with advanced reasoning
A lightweight audio-to-MIDI converter with pitch bend detection
Advanced language and coding AI model
The most powerful local music generation model
TTS with kokoro and onnx runtime
GLM-4.5: Open-source LLM for intelligent agents by Z.ai