Dialogflow
Dialogflow from Google Cloud is a natural language understanding platform that makes it easy to design and integrate a conversational user interface into your mobile app, web application, device, bot, interactive voice response system, and so on. Using Dialogflow, you can provide new and engaging ways for users to interact with your product. Dialogflow can analyze multiple types of input from your customers, including text or audio inputs (like from a phone or voice recording). It can also respond to your customers in a couple of ways, either through text or with synthetic speech. Dialogflow CX and ES provide virtual agent services for chatbots and contact centers. If you have a contact center that employs human agents, you can use Agent Assist to help your human agents. Agent Assist provides real-time suggestions for human agents while they are in conversations with end-user customers.
Learn more
Gemini Audio
Gemini Audio is a set of advanced real-time audio models built on Gemini's architecture, designed to enable natural, fluid voice interaction and expressive audio generation through simple language prompts. It supports conversational experiences where users can speak, listen, and interact with AI in a seamless loop, combining understanding, reasoning, and response generation in audio form. It is capable of both analyzing and generating audio, allowing applications such as speech-to-text transcription, translation, speaker identification, emotion detection, and detailed audio content analysis. They are optimized for low-latency, real-time use cases, making them suitable for live assistants, voice agents, and interactive systems that require continuous, multi-turn dialogue. Gemini Audio also integrates advanced capabilities like function calling, enabling the model to trigger external tools and incorporate real-time data into responses.
Learn more
Vogent
Vogent is an all-in-one platform for building humanlike, intelligent, and effective voice agents. It offers a highly authentic, low-latency live voice AI capable of making phone calls up to one hour long and executing follow-up tasks. Vogent automates calls in industries such as healthcare, construction, logistics, and travel. The platform provides a custom end-to-end pipeline for transcription, reasoning, and speech, resulting in extremely low latency and humanlike conversations. Vogent's in-house language models have been trained on millions of phone conversations across hundreds of different task types, performing as well as human agents when prompted or fine-tuned with minimal examples. Developers can dispatch thousands of calls with a few lines of code and automate downstream workflows based on outcomes. The platform supports REST and GraphQL APIs, and offers a no-code dashboard for creating agents, uploading knowledge bases, tracking dials, and exporting transcripts.
Learn more
Telnyx
Telnyx is a global communications infrastructure platform that provides voice, messaging, networking, and AI-powered real-time communication capabilities through a fully owned telecom stack. The platform combines carrier-grade networking, programmable identity systems, AI inference, and low-latency communication infrastructure to support real-time conversational AI agents and enterprise communication workflows. Telnyx owns and operates its entire network stack, including physical infrastructure, mobile core systems, edge processing, and AI compute layers, enabling faster performance and lower latency without relying on third-party telecom providers. The platform offers tools such as voice agent builders, speech-to-text, text-to-speech, global phone numbers, AI orchestration, and programmable compliance controls for building intelligent voice and messaging systems.
Learn more