Instant voice cloning by MIT and MyShell. Audio foundation model
A Family of Open Sourced Music Foundation Models
Example client of oagi-python developed with Tauri
High-performance neural network inference framework for mobile
SOTA Open Source TTS
Taming Stable Diffusion for Lip Sync
Interface for OuteTTS models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multi-lingual large voice generation model, providing inference
Run PyTorch LLMs locally on servers, desktop and mobile
A lightweight text-to-speech model with zero-shot voice cloning
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
AI Code Security Anti-Patterns distilled from 150+ sources
Expert System Tool
Official code for Style Aligned Image Generation via Shared Attention
Latent Diffusion and Stable Diffusion Implementation
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multilingual voice cloning model with 6-second voice samples