Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Zanzibar is a complete, standards based IVR. It includes an MRCPv2 Server with ASR and TTS engines as well as an voiceXML interpreter so that you can deploy and run voiceXML applications. It integrates with VOIP PBX’s (like Asterisk) using SIP and RTP.
This project is being developed to be a Java based speech recognition (SR) program. In addition to the SR program itself, it includes a program which allows a user to view the sound being received by the computer. The user can manipulate this data.
An IDE for visually impaired users. It supports compiling and immediate error line focus, automatic code clean-up and not to mention all screen-readers E.G. NVDA. Sorry Linux can't work. Also, does NOT require Java Access Bridge.
Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
This is an application where one can set the task to be reminded of in future and you will be notified at that time by voice. You have the option of choosing male/female voice too. Besides, you can choose the time in seconds, minutes or hours.
Cairo sets out to provide an enterprise grade, MRCPv2 compliant speech solution utilizing existing open source speech resources such as FreeTTS and Sphinx-4.
FreeTTS is a speech synthesis engine written entirely in the
Java(tm) programming language. FreeTTS was written by the Sun Microsystems Laboratories Speech Team and is based on CMU's
Flite engine. FreeTTS also includes a partial JSAPI 1.0
Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
audacity-extra now provides a sleek dark themed version of the Audacity open source sound editor. The project experiments with Audacity variations. There's a vowel-sound target-practice display for language learners and an analog waveform data logger for embedded systems.
Speech based User Interface Components Library for Java is a project to create Java controls and applications that can be used not only by literate people but also by non-literates. Speech and visual element with minimal text is used to create components
A text to speech converter which will be able to read any document(Presently it is reading text and .doc files).The main aim of the project is to make reading an interesting task and assist BLIND people.
Free Open Source VoiceXML editor programmed in Java (Swing). The VoiceXML document is regularly parsed, a tree view is built and syntax errors are reported in a specific table.
The SpeakRight Framework is a speech application framework written in Java. SpeakRight applications are fast to create and work on any (VoiceXML) speech platform. Applications are written in Java with full debug and unit testing available.
JeSpeak is a Java library that bridges eSpeak, which is a compact open source software speech synthesizer. JeSpeak uses JNI to make native call to libespeak.
Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
This site is devoted to the collaborative creation of tools, protocols and procedures for field linguistics and language analysis. We are especially interested in tools for annotating or manipulating text, audio and video-based language archives.
AGTK is a suite of software components for building tools for annotating linguistic signals,
time-series data which documents any kind of linguistic behavior (e.g. audio, video).
The internal data structures are based on annotation graphs.
vox-attendant is a telephony voice enabled virtual receptionist designed to work with VoiceXML and includes a distribution for the Voxeo Prophecy platform. It provides a web based interface for managing the directory.
Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.