Instant voice cloning by MIT and MyShell. Audio foundation model
A Family of Open Sourced Music Foundation Models
Example client of oagi-python developed with Tauri
Official inference repo for FLUX.2 models
Interface for OuteTTS models
High-performance neural network inference framework for mobile
A lightweight text-to-speech model with zero-shot voice cloning
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Reference implementations of MLPerf™ training benchmarks
Taming Stable Diffusion for Lip Sync
GUI for a Vocal Remover that uses Deep Neural Networks
SOTA Open Source TTS
ONNX Runtime: cross-platform, high performance ML inferencing
Free and Open Source AI Image Upscaler for Linux, MacOS and Windows
Multi-lingual large voice generation model, providing inference
1 min voice data can also be used to train a good TTS model
Ultra-Efficient AI Assistant in Go
Cross platform .Net wrapper to the OpenCV image processing library
MARS5 speech model (TTS) from CAMB.AI
Open Source version of Claude Cowork built with Claude Code
A high-quality rapid TTS voice cloning model
Towards Human-Level Text-to-Speech through Style Diffusion
AI Code Security Anti-Patterns distilled from 150+ sources
A sound cloning tool with a web interface, using your voice
Python inference and LoRA trainer package for the LTX-2 audio–video