StreamSpeech is a seamless model for offline speech recognition
Foundational model for human-like, expressive TTS
Automatically translates the text of a video based on a subtitle file
Real-time voice interactive digital human
Long-form streaming TTS system for multi-speaker dialogue generation
Bailing is a voice dialogue robot similar to GPT-4o
Scalable generative AI framework built for researchers and developers
Multi-lingual large voice generation model, providing inference
End-to-end speech processing toolkit
A TTS model capable of generating ultra-realistic dialogue
A single Gradio + React WebUI with extensions for ACE-Step
One-click deployment (including offline integration package)
Provides CTP stock options and Zhongtai Securities XTP
MARS5 speech model (TTS) from CAMB.AI
Framework for building neural networks
SOTA discrete acoustic codec models with 40/75 tokens per second
Chuyển đổi văn bản thành giọng nói không giới hạn
Virtual AI anchor that combines state-of-the-art technology
High-quality multi-lingual text-to-speech library by MyShell.ai
Towards Human-Level Text-to-Speech through Style Diffusion
Mice speech to text with MX Cinnamon OS ISO
A Conversational Speech Generation Model
Text to Speech Utility
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS