A text-to-speech, speech-to-text and speech-to-speech library
Use Microsoft Edge's online text-to-speech service from Python
Python library and CLI tool to interface with Google Translate
Offline Text To Speech synthesis for python
Translate the video from one language to another and embed dubbing
A fast TTS architecture with conditional flow matching
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
An opinionated CLI to transcribe Audio files w/ Whisper on-device
EPUB to audiobook converter, optimized for Audiobookshelf
Automatically translates the text of a video based on a subtitle file
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Generate audiobooks from e-books
LLM-based Reinforcement Learning audio edit model
A Web UI for easy subtitle using whisper model
Aligns tokens in two versions of a text with differing tokenization.
Unlimited, private and free Speech-To-Text program
Run GGUF models easily with a UI or API. One File. Zero Install.
Precision Trigonometry: Advanced Calculator for Complex Math
A webui for different audio related Neural Networks
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
WaveRNN Vocoder + TTS
Library of deep learning models and datasets
Vinux is an Ubuntu derived distribution for blind & visually impaired.