Azure AI Speech

Azure AI Speech

Microsoft
Qwen3-TTS

Qwen3-TTS

Alibaba
+
+

Related Products

  • Google Cloud Speech-to-Text
    373 Ratings
    Visit Website
  • QEval
    30 Ratings
    Visit Website
  • Google AI Studio
    11 Ratings
    Visit Website
  • Fathom
    7,148 Ratings
    Visit Website
  • LALAL.AI
    4,565 Ratings
    Visit Website
  • Community Phone
    1,135 Ratings
    Visit Website
  • RingCentral RingEX
    3,189 Ratings
    Visit Website
  • iPlum
    9,124 Ratings
    Visit Website
  • LM-Kit.NET
    23 Ratings
    Visit Website
  • Twilio
    1,343 Ratings
    Visit Website

About

Build voice-enabled apps confidently and quickly with the Speech SDK. Transcribe speech to text with high accuracy, produce natural-sounding text-to-speech voices, translate spoken audio, and use speaker recognition during conversations. Create custom models tailored to your app with Speech studio. Get state-of-the-art speech to text, lifelike text to speech, and award-winning speaker recognition. Your data stays yours, your speech input is not logged during processing. Create custom voices, add specific words to your base vocabulary, or build your own models. Run Speech anywhere, in the cloud or at the edge in containers. Quickly and accurately transcribe audio in more than 92 languages and variants. Gain customer insights with call center transcription, improve experiences with voice-enabled assistants, capture key discussions in meetings and more. Use text to speech to create apps and services that speak conversationally, choosing from more than 215 voices, and 60 languages.

About

Qwen3-TTS is an open source series of advanced text-to-speech models developed by the Qwen team at Alibaba Cloud under the Apache-2.0 license, offering stable, expressive, and real-time speech generation with features such as voice cloning, voice design, and fine-grained control of prosody and acoustic attributes. The models support 10 major languages, including Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, and Italian, and multiple dialectal voice profiles with adaptive control over tone, speaking rate, and emotional expression based on text semantics and instructions. Qwen3-TTS uses efficient tokenization and a dual-track architecture that enables ultra-low-latency streaming synthesis (first audio packet in ~97 ms), making it suitable for interactive and real-time use cases, and includes a range of models with different capabilities (e.g., rapid 3-second voice cloning, custom voice timbres, and instruction-based voice design).

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

Developers in search of a solution to transcribe speech to text and build voice-enabled apps

Audience

Researchers who need a model for expressive, multilingual, controllable, and streaming voice generation in applications like voice assistants, dubbing, accessibility, and creative audio synthesis

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

Free
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Microsoft
Founded: 1975
United States
azure.microsoft.com/en-us/products/ai-services/ai-speech

Company Information

Alibaba
Founded: 1999
China
github.com/QwenLM/Qwen3-TTS

Alternatives

Alternatives

Inworld TTS

Inworld TTS

Inworld
Fish Audio

Fish Audio

Hanabi AI
Chirp 3

Chirp 3

Google

Categories

Categories

Integrations

Alibaba Cloud
Azure Marketplace
Blabby
Crestwood Cloud
Custom Neural Voice
Microsoft 365
Microsoft Azure
Qwen
Restack
Whisper

Integrations

Alibaba Cloud
Azure Marketplace
Blabby
Crestwood Cloud
Custom Neural Voice
Microsoft 365
Microsoft Azure
Qwen
Restack
Whisper
Claim Azure AI Speech and update features and information
Claim Azure AI Speech and update features and information
Claim Qwen3-TTS and update features and information
Claim Qwen3-TTS and update features and information