Realtime TTS-2Inworld
|
||||||
Related Products
|
||||||
About
Realtime TTS-2 from Inworld AI is a new generation of voice model built for real-time conversation: a voice model that feels as human as it sounds. It hears the full audio of an exchange, picks up the user’s tone, pacing, and emotional state, then takes voice direction in plain English, the way developers prompt an LLM. Instead of generating speech in isolation, it listens to prior turns of the exchange, so tone and pacing carry forward, and the same line can land differently after a joke than after bad news. Voice Direction lets developers steer delivery like a director would steer a voice actor, using natural-language descriptions rather than fixed emotion presets or sliders. Inline nonverbals like [sigh], [breathe], and [laugh] can be placed inside the text, and the model renders them as audio events. Realtime TTS-2 preserves one voice identity across more than 100 languages, including mid-utterance language switches.
|
About
Realistic text generator. The following features are available:
- Voicing of huge texts. Up to 2 000 000 characters per generation. You can voice a large book at a time and get 1 file.
- 270+ voices in 33 languages
- Easy to edit. You can mark up text and generate audio with segments.
- You can add several different voices to one audio.
- It is convenient to select a voice. Listen to a demo of each voice and choose your favorite.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Voice AI developers building realtime agents, characters, tutors, support systems, and companions that need emotionally aware, multilingual, humanlike speech
|
Audience
Video makers, Newsmakers, Students, Foreigners, Marketers, Software developers, Educators, Webmasters
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
$25 per month
Free Version
Free Trial
|
Pricing
$4.99
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationInworld
Founded: 2021
United States
inworld.ai/blog/realtime-tts-2
|
Company InformationSpeechGen
Founded: 2020
Kazakhstan
speechgen.io
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
|
|
||||||
Categories |
Categories |
|||||
Integrations
ChatGPT
Claude
Gemini
Grok
Perplexity
|
||||||
|
|
|