GUI for a Vocal Remover that uses Deep Neural Networks
Generate short videos with one click using AI LLM
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Official Python inference and LoRA trainer package
Wan2.2: Open and Advanced Large-Scale Video Generative Model
OCR software, free and offline
A high-quality rapid TTS voice cloning model
High-Quality Voice Cloning TTS for 600+ Languages
Official repository for LTX-Video
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-source, high-performance AI model with advanced reasoning
Open-Sora: Democratizing Efficient Video Production for All
A high-performance image compression microservice based on MCP
Industry leading face manipulation platform
Focus on prompting and generating
A high-throughput and memory-efficient inference and serving engine
A simple, high-quality voice conversion tool focused on ease of use
High-level training, data augmentation, and utilities for Pytorch
A high-quality tool for convert PDF to Markdown and JSON
Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents
Recovering the Visual Space from Any Views
State-of-the-art TTS model under 25MB
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Port of OpenAI's Whisper model in C/C++
MOSS-TTS-Nano is an open-source multilingual tiny speech generation