Instant voice cloning by MIT and MyShell. Audio foundation model
A Family of Open Sourced Music Foundation Models
Official inference repo for FLUX.2 models
Example client of oagi-python developed with Tauri
Reference implementations of MLPerf™ training benchmarks
SOTA Open Source TTS
High-performance neural network inference framework for mobile
Interface for OuteTTS models
A lightweight text-to-speech model with zero-shot voice cloning
GUI for a Vocal Remover that uses Deep Neural Networks
Taming Stable Diffusion for Lip Sync
1 min voice data can also be used to train a good TTS model
Ultra-Efficient AI Assistant in Go
Multi-lingual large voice generation model, providing inference
ONNX Runtime: cross-platform, high performance ML inferencing
A high-quality rapid TTS voice cloning model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Free and Open Source AI Image Upscaler for Linux, MacOS and Windows
MARS5 speech model (TTS) from CAMB.AI
Towards Human-Level Text-to-Speech through Style Diffusion
A sound cloning tool with a web interface, using your voice
5ire is a cross-platform desktop AI assistant, MCP client
Python inference and LoRA trainer package for the LTX-2 audio–video
Cross platform .Net wrapper to the OpenCV image processing library
gpt-oss-120b and gpt-oss-20b are two open-weight language models