YOLOv5 is the world's most loved vision AI
The most powerful and modular diffusion model GUI, api and backend
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Contexts Optical Compression
Robust Speech Recognition via Large-Scale Weak Supervision
OCRmyPDF adds an OCR text layer to scanned PDF files
The official gpt4free repository
The official Python SDK for Model Context Protocol servers and clients
Open-source, high-performance AI model with advanced reasoning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Powerful AI language model (MoE) optimized for efficiency/performance
Stable Diffusion web UI
Qwen3 is the large language model series developed by Qwen team
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Framework for Telegram Bot API written in Python 3.7 with asyncio
InvokeAI is a leading creative engine for Stable Diffusion models
NVR with realtime local object detection for IP cameras
Image inpainting tool powered by SOTA AI Model
A Lightweight Face Recognition and Facial Attribute Analysis
Open-Sora: Democratizing Efficient Video Production for All
Awesome multilingual OCR toolkits based on PaddlePaddle
A deep learning toolkit for Text-to-Speech, battle-tested in research
Open Source Document Management System for Digital Archives
Label Studio is a multi-type data labeling and annotation tool
An experimental version of DeepSeek model