Instant voice cloning by MIT and MyShell. Audio foundation model
A Unified Framework for Image Customization
Python HTTP client with TLS and HTTP/2 fingerprint emulation support
Qwen-Image is a powerful image generation foundation model
An open source chat-ops bot framework
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Pretrained model hub for Keras 3
ComfyUI nodes for LivePortrait
Photorealistic Synthetic Dataset for Holistic Indoor Scene
A lightweight, powerful framework for multi-agent workflows
The largest collection of PyTorch image encoders / backbones
Python client for FCM - Firebase Cloud Messaging
Open source AI model for generating full songs from lyrics prompts
Open source multimodal creative AI assistant with infinite canvas tool
Supercharge Your LLM with the Fastest KV Cache Layer
Pytest in IPython notebooks
VMZ: Model Zoo for Video Modeling
Skywork-R1V is an advanced multimodal AI model series
Omnilingual ASR Open-Source Multilingual SpeechRecognition
An industrial grade federated learning framework
Super Tiny Icons are miniscule SVG versions of your favourite website
Pytorch domain library for recommendation systems
The data structure for multimodal data
Chat & pretrained large audio language model proposed by Alibaba Cloud
Towards Human-Level Text-to-Speech through Style Diffusion