Instant voice cloning by MIT and MyShell. Audio foundation model
A Family of Open Sourced Music Foundation Models
Official inference repo for FLUX.2 models
Example client of oagi-python developed with Tauri
Interface for OuteTTS models
High-performance neural network inference framework for mobile
Reference implementations of MLPerf™ training benchmarks
SOTA Open Source TTS
Maid is a cross-platform Flutter app for interfacing with GGUF
A lightweight text-to-speech model with zero-shot voice cloning
Taming Stable Diffusion for Lip Sync
GUI for a Vocal Remover that uses Deep Neural Networks
High-Resolution Image Synthesis with Latent Diffusion Models
ONNX Runtime: cross-platform, high performance ML inferencing
Multi-lingual large voice generation model, providing inference
Ultra-Efficient AI Assistant in Go
1 min voice data can also be used to train a good TTS model
Free and Open Source AI Image Upscaler for Linux, MacOS and Windows
A high-quality rapid TTS voice cloning model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
MARS5 speech model (TTS) from CAMB.AI
Cross platform .Net wrapper to the OpenCV image processing library
Towards Human-Level Text-to-Speech through Style Diffusion
A sound cloning tool with a web interface, using your voice
Flutter-based cross-platform app integrating major AI models