Use Microsoft Edge's online text-to-speech service from Python
Multimodal-Driven Architecture for Customized Video Generation
AI app store powered by 24/7 desktop history. open source
The most powerful and modular diffusion model GUI, api and backend
A sound cloning tool with a web interface, using your voice
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
Open source text-to-speech tool, supports extra-long text
Sample code and notebooks for Generative AI on Google Cloud
Converts text to speech in realtime
Unofficial Python API and agentic skill for Google NotebookLM
Label Studio is a multi-type data labeling and annotation tool
Workflow and speech recognition app
AI tool that turns Hacker News posts into daily podcast updates
Open source AI model for generating full songs from lyrics prompts
An Open Source implementation of Notebook LM with more flexibility
Automatically translates the text of a video based on a subtitle file
Speech recognition for your site
One-click deployment (including offline integration package)
The python library for real-time communication
A TTS model capable of generating ultra-realistic dialogue
Generate audiobooks from e-books
Web presentation editor replicating many PowerPoint features online
The official Node.js / Typescript library for the Groq API
High-Quality Voice Cloning TTS for 600+ Languages