Gemini Live APIGoogle
|
||||||
Related Products
|
||||||
About
The Gemini Live API is a preview feature that enables low-latency, bidirectional voice and video interactions with Gemini. It allows end users to experience natural, human-like voice conversations and provides the ability to interrupt the model's responses using voice commands. The model can process text, audio, and video input, and it can provide text and audio output. New capabilities include two new voices and 30 new languages with configurable output language, configurable image resolutions (66/256 tokens), configurable turn coverage (send all inputs all the time or only when the user is speaking), configurable interruption settings, configurable voice activity detection, new client events for end-of-turn signaling, token counts, a client event for signaling the end of stream, text streaming, configurable session resumption with session data stored on the server for 24 hours, and longer session support with a sliding context window.
|
About
Layercode is a cloud-based developer platform that makes it easy to build production-ready, low-latency voice AI agents by handling the real-time infrastructure so you can focus on your agent’s logic; it manages WebSockets, voice activity detection, global edge deployment, and voice model integrations while giving you full control over how your agent thinks, speaks, and responds. It enables natural, fluid voice conversations with sub-second response times and human-like turn-taking, offers observability tools so you can inspect calls, latency, and failures in production, and fits naturally into modern TypeScript and Next.js stacks with simple CLI and SDK support so you can receive text and send text back. With Layercode, you can avoid vendor lock-in by hot-swapping leading voice and transcription model providers, maintain complete flexibility by plugging in your own AI agent backend, and deploy voice agents across web, mobile, and phone interfaces.
|
|||||
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
Platforms Supported
Windows
Mac
Linux
Cloud
On-Premises
iPhone
iPad
Android
Chromebook
|
|||||
Audience
Researchers looking for a solution to build real-time, multimodal AI applications that require low-latency voice and video interactions
|
Audience
Software developers building real-time conversational AI and voice-enabled applications who need dependable cloud infrastructure to deploy low-latency, production-ready voice AI agents
|
|||||
Support
Phone Support
24/7 Live Support
Online
|
Support
Phone Support
24/7 Live Support
Online
|
|||||
API
Offers API
|
API
Offers API
|
|||||
Screenshots and Videos |
Screenshots and Videos |
|||||
Pricing
No information available.
Free Version
Free Trial
|
Pricing
$0.04 per minute
Free Version
Free Trial
|
|||||
Reviews/
|
Reviews/
|
|||||
Training
Documentation
Webinars
Live Online
In Person
|
Training
Documentation
Webinars
Live Online
In Person
|
|||||
Company InformationGoogle
Founded: 1998
United States
ai.google.dev/gemini-api/docs/live
|
Company InformationLayercode
United Kingdom
layercode.com
|
|||||
Alternatives |
Alternatives |
|||||
|
|
||||||
|
|
|
|||||
|
|
|
|||||
|
|
|
|||||
Categories |
Categories |
|||||
Integrations
Gemini
Cartesia Sonic
Claude
Daily
ElevenLabs
Gemini Enterprise
Go
Google AI Studio
LangChain
Nano Banana
|
Integrations
Gemini
Cartesia Sonic
Claude
Daily
ElevenLabs
Gemini Enterprise
Go
Google AI Studio
LangChain
Nano Banana
|
|||||
|
|
|