Streaming Real-time Audio-Driven Avatar Generation
Open source text-to-speech tool, supports extra-long text
Interface for OuteTTS models
Give Claude the ability to watch and understand videos
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
WhatsApp library for NodeJS that connects through the browser app
2023, the latest audio and video learning materials, projects
A single Gradio + React WebUI with extensions for ACE-Step
Use Microsoft Edge's online text-to-speech service from Python
Self-hosted AI audio transcription
Convert files and web content into clean, usable Markdown easily
Open Source Speech Language Model
Generate blog articles from video or audio
A Systematic Framework for Interactive World Modeling
Towards Human-Sounding Speech
PersonaPlex code
Multimodal-Driven Architecture for Customized Video Generation
Descent 3 by Outrage Entertainment
Self-hosted collection of powerful web-based tools for everyday tasks
MARS5 speech model (TTS) from CAMB.AI
AI tool converting video/audio into structured documents instantly
Simple DirectMedia Layer
Workflow and speech recognition app
Cross-platform Music Visualization Library