A nearly-live implementation of OpenAI's Whisper
Build Vision Agents quickly with any model or video provider
MARS5 speech model (TTS) from CAMB.AI
Virtual AI anchor that combines state-of-the-art technology
EPUB to audiobook converter, optimized for Audiobookshelf
A text-to-speech, speech-to-text and speech-to-speech library
End-to-end speech processing toolkit
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Pre-trained and Reproduced Deep Learning Models
The open-source virtual assistant for Ubuntu based Linux distributions
TensorFlow Implementation of DC-TTS: yet another text-to-speech model