Instant voice cloning by MIT and MyShell. Audio foundation model
A Family of Open Sourced Music Foundation Models
Example client of oagi-python developed with Tauri
High-performance neural network inference framework for mobile
SOTA Open Source TTS
Interface for OuteTTS models
Taming Stable Diffusion for Lip Sync
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multi-lingual large voice generation model, providing inference
Run PyTorch LLMs locally on servers, desktop and mobile
A lightweight text-to-speech model with zero-shot voice cloning
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
AI Code Security Anti-Patterns distilled from 150+ sources
Jupyter notebooks that walk you through the fundamentals of ML
Expert System Tool
Official code for Style Aligned Image Generation via Shared Attention
Latent Diffusion and Stable Diffusion Implementation
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multilingual voice cloning model with 6-second voice samples