SOTA Open Source TTS
Multimodal-Driven Architecture for Customized Video Generation
The music player of today
Towards Human-Sounding Speech
MARS5 speech model (TTS) from CAMB.AI
Use Microsoft Edge's online text-to-speech service from Python
The most powerful and modular diffusion model GUI, api and backend
A high-quality rapid TTS voice cloning model
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A sound cloning tool with a web interface, using your voice
Sample code and notebooks for Generative AI on Google Cloud
High-quality multi-lingual text-to-speech library by MyShell.ai
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Automatically translates the text of a video based on a subtitle file
Oobabooga - The definitive Web UI for local AI, with powerful features
A lightweight text-to-speech model with zero-shot voice cloning
Controllable & emotion-expressive zero-shot TTS
Industrial-level controllable zero-shot text-to-speech system
The official Python library for the OpenAI API
Video editing with Python
A python tool that uses GPT-4, FFmpeg, and OpenCV
Generate audiobooks from e-books
Offline Text To Speech synthesis for python
Python library and CLI tool to interface with Google Translate