Powerful AI language model (MoE) optimized for efficiency/performance
Image inpainting tool powered by SOTA AI Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Robust Speech Recognition via Large-Scale Weak Supervision
1 min voice data can also be used to train a good TTS model
OCRmyPDF adds an OCR text layer to scanned PDF files
3D reconstruction software
TTS with kokoro and onnx runtime
Open-source, high-performance AI model with advanced reasoning
Awesome multilingual OCR toolkits based on PaddlePaddle
A simple, high-quality voice conversion tool focused on ease of use
OCR software, free and offline
Official Python inference and LoRA trainer package
The most powerful local music generation model
A theoretical reconstruction of the Claude Mythos architecture
AI tool that removes hardcoded subtitles and text from videos locally
Native and Compact Structured Latents for 3D Generation
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.2 models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Python tool for converting files and office documents to Markdown
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Qwen3-TTS is an open-source series of TTS models
OBLITERATE THE CHAINS THAT BIND YOU
A Lightweight Face Recognition and Facial Attribute Analysis