EVI 3Hume AI
|
MAI-Voice-1Microsoft
|
|||||
Related Products
|
||||||
About
Hume AI's EVI 3 is a third-generation speech-language model that streams in user speech and forms natural, expressive speech and language responses. At conversational latency, it produces the same quality of speech as our text-to-speech model, Octave. Simultaneously, it responds with the same intelligence as the most advanced LLMs of similar latency. It also communicates with reasoning models and web search systems as it speaks, “thinking fast and slow” to match the intelligence of any frontier AI system. EVI 3 can instantly generate new voices and personalities instead of being limited to a handful of speakers. For instance, users can speak to any of the more than 100,000 custom voices already created on our text-to-speech platform, each with an inferred personality. No matter the voice, it responds with a wide range of emotions or styles, implicitly or on command.
|
About
MAI-Voice-1 is Microsoft AI’s first highly expressive and natural speech generation model, designed to produce high-fidelity, emotionally rich audio across single- and multi-speaker scenarios with extraordinary efficiency, capable of generating a full minute of audio in under one second on a single GPU. Integrated into Copilot Daily and Podcasts, it powers a new Copilot Labs experience where users can test its expressive speech and storytelling capabilities, such as crafting “choose your own adventure” narratives or bespoke guided meditations using simple prompts. Voice is envisioned as the interface of the future for AI companions, and MAI-Voice-1 delivers this vision through its lightning-fast performance and realism, making it one of the most efficient speech systems available. Microsoft is exploring the potential of voice interfaces to create immersive, personalized AI interactions.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Developers and businesses in search of a solution to integrate emotionally intelligent, real-time voice AI into their applications
|
Audience
Users and developers seeking a solution to get conversational audio generation to enrich AI interactions with expressive, natural-sounding voice
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
Free
Free Version
Free Trial
|
Pricing
No information available.
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationHume AI
Founded: 2021
United States
www.hume.ai/blog/introducing-evi-3
|
Company InformationMicrosoft
Founded: 1975
United States
microsoft.ai/news/two-new-in-house-models/
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
||||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
|
|
|