Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud
Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
A series of open source files and programs available to use for developing programs to work with the WowWee Robotics RSMedia Robot. These include a USB serial console, a cross-compiler, a firmware dump program, text-to-speech and source code.
App in java for chatting to a generative A.I. (involving tts and stt)
Java application for chatting to generative AI Llama3.
* The user can speak into the microphone (speechToText), edit the recognized text and send it to the AI.
* The AI responds and the server returns that response in real time, and the sentences converted to audio (textToSpeech), and the application broadcasts them through the speaker.
The application is prepared so that only one user occupies the server's resources, so if the server is busy, in theory it will not let you...
A fork of the navmol (https://sourceforge.net/projects/navmol/)
NavMol with practical improvements, the addition of menus, the support of Mandarin, the addition of the text-to-speech, the implementation of the interrupt function of speech, and the full internationalization of text, easier and more convenient to be used.
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
This is a Java Wrapper for Cepstral.com text-to-speech engine. Cepstral makes very affordable realistic synthetic voices and provides the developers with C++ API's. We have developed a JSAPI compliant Java-to-JNI-to-C++ Wrapper to use with Cepstral TTS.
A text to speech converter which will be able to read any document(Presently it is reading text and .doc files).The main aim of the project is to make reading an interesting task and assist BLIND people.
Virtual News Reader is a computer desktop application that can convert text (idealy for online news) into Virtual Voice and can be saved on computer. The project is using Java, (JSAPI), FreeTTS (Text-To-Speech synthesis), DJProject, Substance, & other.
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.
Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
This project aims to distribute a facial animation system with speech, developed to brazilian portuguese case. This system is composed by many modules: movement extraction, facial animation and speech, through a text-to-speech system.
Internet Text Radio,designed around freeTTS,connects to a text server and tunes to a channel.The server starts pumping text data for that channel to the client, which converts text to speech, playing back the text as audio,like an internet radio station.
Kathak is a Bangla text to speech synthesizer which can produce the speech from unicode bangla text input. We are developing the system based on Festvox framework.The Festival Speech Synthesis System was used as a base for developing Kathak.
Voxapl is a suite of voice-enabled applications supporting voice recognition and text to speech functions for home automation services. It is written in Java and communicates via xAP and/or xPL messaging.
A graphical, interactive, multimedia-rich text reader integrated with computer text-to-speech whose level of focus, complexity, and detail is adjustable according to the user's needs. Suitable for users with learning disabilities: i.e. dyslexia.