+
+

Related Products

  • Google Cloud Speech-to-Text
    361 Ratings
    Visit Website
  • Gemini Enterprise Agent Platform
    962 Ratings
    Visit Website
  • DialerAI
    5 Ratings
    Visit Website
  • LALAL.AI
    5,019 Ratings
    Visit Website
  • Squaretalk
    276 Ratings
    Visit Website
  • Assembled
    254 Ratings
    Visit Website
  • Dialpad Support
    1,583 Ratings
    Visit Website
  • Forethought
    167 Ratings
    Visit Website
  • The Asset Guardian EAM (TAG)
    22 Ratings
    Visit Website
  • UptimeRobot
    809 Ratings
    Visit Website

About

Miso Labs builds emotive foundation models for voice, designed to help developers create voice agents that feel fast, warm, and human instead of robotic or delayed. Its flagship model, Miso TTS, is an 8-billion-parameter transformer model for state-of-the-art emotive speech and dialogue generation, with open source weights available on Hugging Face and API access coming soon. Miso is built for real-time conversational voice, responding in 110ms to preserve natural flow and avoid the awkward pauses common in AI voice agents. It supports one-shot voice cloning, allowing users to clone a voice from a ten-second audio clip while keeping the agent’s voice consistent from the first second of a call to the last. Miso Labs also emphasizes local and sovereign deployment, with open source models built for local use and on-premises hosting and support available for enterprise teams that need to keep sensitive data in-house.

About

Realtime TTS-2 from Inworld AI is a new generation of voice model built for real-time conversation: a voice model that feels as human as it sounds. It hears the full audio of an exchange, picks up the user’s tone, pacing, and emotional state, then takes voice direction in plain English, the way developers prompt an LLM. Instead of generating speech in isolation, it listens to prior turns of the exchange, so tone and pacing carry forward, and the same line can land differently after a joke than after bad news. Voice Direction lets developers steer delivery like a director would steer a voice actor, using natural-language descriptions rather than fixed emotion presets or sliders. Inline nonverbals like [sigh], [breathe], and [laugh] can be placed inside the text, and the model renders them as audio events. Realtime TTS-2 preserves one voice identity across more than 100 languages, including mid-utterance language switches.

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Platforms Supported

Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook

Audience

AI developers and enterprise agent teams that need low-latency, emotionally expressive text-to-speech, one-shot voice cloning, and local deployment options

Audience

Voice AI developers building realtime agents, characters, tutors, support systems, and companions that need emotionally aware, multilingual, humanlike speech

Support

Phone Support
24/7 Live Support
Online

Support

Phone Support
24/7 Live Support
Online

API

Offers API

API

Offers API

Screenshots and Videos

Screenshots and Videos

Pricing

No information available.
Free Version
Free Trial

Pricing

$25 per month
Free Version
Free Trial

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Reviews/Ratings

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Training

Documentation
Webinars
Live Online
In Person

Training

Documentation
Webinars
Live Online
In Person

Company Information

Miso TTS
Founded: 2025
United States
www.misolabs.ai/

Company Information

Inworld
Founded: 2021
United States
inworld.ai/blog/realtime-tts-2

Alternatives

Alternatives

Inworld TTS

Inworld TTS

Inworld
Listnr

Listnr

Listnr AI
LOVO

LOVO

Love Your Voice

Categories

Categories

Integrations

ChatGPT
Claude
Gemini
Grok
Hugging Face
Perplexity

Integrations

ChatGPT
Claude
Gemini
Grok
Hugging Face
Perplexity
Claim Miso TTS and update features and information
Claim Miso TTS and update features and information
Claim Realtime TTS-2 and update features and information
Claim Realtime TTS-2 and update features and information