High-Quality Voice Cloning TTS for 600+ Languages
Download videos from almost any website
An Open Source implementation of Notebook LM with more flexibility
Edit videos with Claude Code
Oobabooga - The definitive Web UI for local AI, with powerful features
Pushing the Frontier of Long Audio-Visual Generation
An open-source, ultra-low-latency remote desktop for Linux hosts
MOSS‑TTS Family open‑source speech and sound generation model
Framework for building real-time voice and multimodal AI agents
Free, high-quality text-to-speech API endpoint to replace OpenAI
The most powerful and modular diffusion model GUI, api and backend
Implementation of AudioLM audio generation model in Pytorch
A Web UI for easy subtitle using whisper model
Unofficial Python API and agentic skill for Google NotebookLM
MARS5 speech model (TTS) from CAMB.AI
Multimodal-Driven Architecture for Customized Video Generation
Unified web UI for training and running open models locally
Offline Text To Speech synthesis for python
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A youtube-dl fork with additional features and fixes
Use Microsoft Edge's online text-to-speech service from Python
The official Python SDK for the ElevenLabs API
Sample code and notebooks for Generative AI on Google Cloud
Data manipulation and transformation for audio signal processing
EPUB to audiobook converter, optimized for Audiobookshelf