AI video generator optimized for low VRAM and older GPUs use
Stable Diffusion web UI
Modular AI image and video generation web UI with extensible tools
Generate audiobooks from EPUBs, PDFs and text with captions
Visual Causal Flow
A TTS that fits in your CPU (and pocket)
Qwen3-ASR is an open-source series of ASR models
Enhances Tesseract OCR output using LLMs (local or API)
OCR model for complex documents with layout-aware structured outputs
Open source platform for the machine learning lifecycle
95% token savings. 155x faster queries. 16 languages
The fastest way to build data pipelines
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Advanced AI Explainability for computer vision
Stable Diffusion web UI
LLM-based agent for general purpose software engineering tasks
A Repo For Document AI
General-purpose image editing model that delivers high-fidelity
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Multi-Voice and Prompt-Controlled TTS Engine
Code for the paper Hybrid Spectrogram and Waveform Source Separation
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator