Data manipulation and transformation for audio signal processing
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
An Open Source implementation of Notebook LM with more flexibility
Generate blog articles from video or audio
Offline Text To Speech synthesis for python
Oobabooga - The definitive Web UI for local AI, with powerful features
MARS5 speech model (TTS) from CAMB.AI
Free, high-quality text-to-speech API endpoint to replace OpenAI
Use Microsoft Edge's online text-to-speech service from Python
Open source AI model for generating full songs from lyrics prompts
Unified web UI for training and running open models locally
Generate audiobooks from e-books
A Systematic Framework for Interactive World Modeling
Helps scientists define testable, modular, self-documenting dataflow
DoWhy is a Python library for causal inference
Sample code and notebooks for Generative AI on Google Cloud
A sound cloning tool with a web interface, using your voice
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Framework for building real-time voice and multimodal AI agents
ImageBind One Embedding Space to Bind Them All
Multi-user UI for managing and running Stable Diffusion workflows tool
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
A python tool that uses GPT-4, FFmpeg, and OpenCV