Official inference repo for FLUX.1 models
Robust Speech Recognition via Large-Scale Weak Supervision
Stable Diffusion web UI
A high-throughput and memory-efficient inference and serving engine
Personal AI, On Personal Devices
Public repository for Agent Skills
State-of-the-art TTS model under 25MB
1 min voice data can also be used to train a good TTS model
OCRmyPDF adds an OCR text layer to scanned PDF files
Reverse-engineered Python API for Google Gemini web app
Awesome multilingual OCR toolkits based on PaddlePaddle
Deepfakes Software For All
Image polygonal annotation with Python
Code for running inference and finetuning with SAM 3 model
The most powerful and modular diffusion model GUI, api and backend
MCP Server for IDA Pro
The official Python client for the Huggingface Hub
A simple, high-quality voice conversion tool focused on ease of use
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
An Async Bot/API wrapper for Twitch made in Python
3D reconstruction software
Ready-to-use OCR with 80+ supported languages
OBLITERATE THE CHAINS THAT BIND YOU
Generate short videos with one click using AI LLM
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning