Oobabooga - The definitive Web UI for local AI, with powerful features
An Open Source implementation of Notebook LM with more flexibility
Pushing the Frontier of Long Audio-Visual Generation
Generate audiobooks from e-books
MOSS‑TTS Family open‑source speech and sound generation model
The official Python SDK for the ElevenLabs API
Sample code and notebooks for Generative AI on Google Cloud
Edit videos with Claude Code
Unified web UI for training and running open models locally
Implementation of AudioLM audio generation model in Pytorch
High-Quality Voice Cloning TTS for 600+ Languages
Unofficial Python API and agentic skill for Google NotebookLM
Interface for OuteTTS models
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multimodal-Driven Architecture for Customized Video Generation
A Web UI for easy subtitle using whisper model
Offline Text To Speech synthesis for python
Label Studio is a multi-type data labeling and annotation tool
Use Microsoft Edge's online text-to-speech service from Python
Data manipulation and transformation for audio signal processing
Voice Recognition to Text Tool
Automatically translates the text of a video based on a subtitle file
EPUB to audiobook converter, optimized for Audiobookshelf
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
AI tool converting video/audio into structured documents instantly