A simple, high-quality voice conversion tool focused on ease of use
Web interface for generating images using Stable Diffusion models
NVR with realtime local object detection for IP cameras
Synchronized Translation for Videos
A high-throughput and memory-efficient inference and serving engine
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
RGBD video generation model conditioned on camera input
From Images to High-Fidelity 3D Assets
A Lightweight Face Recognition and Facial Attribute Analysis
Use Microsoft Edge's online text-to-speech service from Python
A gradio web UI for running Large Language Models like LLaMA
Generate audiobooks from e-books, voice cloning & 1107+ languages
Comprehensive Gradio WebUI for audio processing
A Python wrapper you can't refuse
Open-Sora: Democratizing Efficient Video Production for All
Real-World Centric Foundation GUI Agents
Python inference and LoRA trainer package for the LTX-2 audio–video
A command-line productivity tool powered by AI large language models
Generate short videos with one click using AI LLM
Qwen3-Coder is the code version of Qwen3
Models for object and human mesh reconstruction
Machine learning in Python
text and image to video generation: CogVideoX (2024) and CogVideo
TensorFlow is an open source library for machine learning
Chemcrow