TTS with kokoro and onnx runtime
Speech-AI-Forge is a project developed around TTS generation model
Offline inference engine for art, real-time voice conversations
Multi-Voice and Prompt-Controlled TTS Engine
Towards Human-Sounding Speech
Interface for OuteTTS models
One-click deployment (including offline integration package)
Provides CTP stock options and Zhongtai Securities XTP
A nearly-live implementation of OpenAI's Whisper
StreamSpeech is a seamless model for offline speech recognition
Synchronized Translation for Videos
An Open Source text-to-speech system built by inverting Whisper
Virtual AI anchor that combines state-of-the-art technology
Bailing is a voice dialogue robot similar to GPT-4o
Build Vision Agents quickly with any model or video provider
A single Gradio + React WebUI with extensions for ACE-Step
Unofficial Parallel WaveGAN
Chinese text-to-speech engine
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS
Toolkit for efficient experimentation with Speech Recognition