TTS with kokoro and onnx runtime
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Containerized automation engine for programmable CI/CD workflows
Image inpainting tool powered by SOTA AI Model
Official Python inference and LoRA trainer package
Wan2.1: Open and Advanced Large-Scale Video Generative Model
1 min voice data can also be used to train a good TTS model
A simple, high-quality voice conversion tool focused on ease of use
OCRmyPDF adds an OCR text layer to scanned PDF files
The most powerful local music generation model
Robust Speech Recognition via Large-Scale Weak Supervision
Python inference and LoRA trainer package for the LTX-2 audio–video
3D reconstruction software
OCR software, free and offline
Awesome multilingual OCR toolkits based on PaddlePaddle
Agent-ready RPA suite with visual workflow automation tools engine
World's first open-source, agentic video production system
Open-source, high-performance AI model with advanced reasoning
AI tool that removes hardcoded subtitles and text from videos locally
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Python tool for converting files and office documents to Markdown
A modular, primitive-first, python-first PyTorch library
Enterprise platform for building and orchestrating AI agent workflows