Web interface for generating images using Stable Diffusion models
NVR with realtime local object detection for IP cameras
Synchronized Translation for Videos
A high-throughput and memory-efficient inference and serving engine
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
RGBD video generation model conditioned on camera input
From Images to High-Fidelity 3D Assets
A Lightweight Face Recognition and Facial Attribute Analysis
Use Microsoft Edge's online text-to-speech service from Python
A gradio web UI for running Large Language Models like LLaMA
Comprehensive Gradio WebUI for audio processing
Generate audiobooks from e-books, voice cloning & 1107+ languages
A Python wrapper you can't refuse
Open-Sora: Democratizing Efficient Video Production for All
Real-World Centric Foundation GUI Agents
A command-line productivity tool powered by AI large language models
Python inference and LoRA trainer package for the LTX-2 audio–video
Generate short videos with one click using AI LLM
Machine learning in Python
Qwen3-Coder is the code version of Qwen3
Models for object and human mesh reconstruction
text and image to video generation: CogVideoX (2024) and CogVideo
Chemcrow
TensorFlow is an open source library for machine learning
InvokeAI is a leading creative engine for Stable Diffusion models