Qwen3-TTS is an open-source series of TTS models
Automatically translates the text of a video based on a subtitle file
Industrial-level controllable zero-shot text-to-speech system
Controllable & emotion-expressive zero-shot TTS
State-of-the-art TTS model under 25MB
Build Vision Agents quickly with any model or video provider
A Conversational Speech Generation Model