A Lightweight Face Recognition and Facial Attribute Analysis
1 min voice data can also be used to train a good TTS model
A high-throughput and memory-efficient inference and serving engine
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Web interface for generating images using Stable Diffusion models
Synchronized Translation for Videos
RGBD video generation model conditioned on camera input
Generate audiobooks from e-books, voice cloning & 1107+ languages
Use Microsoft Edge's online text-to-speech service from Python
Open-Sora: Democratizing Efficient Video Production for All
Comprehensive Gradio WebUI for audio processing
A gradio web UI for running Large Language Models like LLaMA
A Python wrapper you can't refuse
Machine learning in Python
Real-World Centric Foundation GUI Agents
Python inference and LoRA trainer package for the LTX-2 audio–video
Models for object and human mesh reconstruction
text and image to video generation: CogVideoX (2024) and CogVideo
From Images to High-Fidelity 3D Assets
Reference PyTorch implementation and models for DINOv3
Generate short videos with one click using AI LLM
TensorFlow is an open source library for machine learning
A sound cloning tool with a web interface, using your voice
No fortress, purely open ground. OpenManus is Coming
Chemcrow