Towards Human-Sounding Speech
Build Vision Agents quickly with any model or video provider
The official Python SDK for the ElevenLabs API
Long-form streaming TTS system for multi-speaker dialogue generation
SoTA open-source TTS
StreamSpeech is a seamless model for offline speech recognition
Tokenizer-Free TTS for Multilingual Speech Generation
Bailing is a voice dialogue robot similar to GPT-4o
Converts text to speech in realtime
A nearly-live implementation of OpenAI's Whisper
A fast TTS architecture with conditional flow matching
A TTS model capable of generating ultra-realistic dialogue
Real-time voice interactive digital human
Free, high-quality text-to-speech API endpoint to replace OpenAI