Scalable generative AI framework built for researchers and developers
Offline Text To Speech synthesis for python
StreamSpeech is a seamless model for offline speech recognition
Offline inference engine for art, real-time voice conversations
A TTS that fits in your CPU (and pocket)
One-click deployment (including offline integration package)
Framework for building neural networks
Build Vision Agents quickly with any model or video provider
A sound cloning tool with a web interface, using your voice
Virtual AI anchor that combines state-of-the-art technology
A lightweight text-to-speech model with zero-shot voice cloning
A text-to-speech, speech-to-text and speech-to-speech library
Real-time voice interactive digital human
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
An Open Source text-to-speech system built by inverting Whisper
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Text to Speech Utility
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Chinese voice dialogue robot/smart speaker project
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
General Speech Restoration
Written or imported text offline read or online download.
Toolkit for efficient experimentation with Speech Recognition