Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.
Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.
Try Vertex AI Free
Managed MySQL, PostgreSQL, and SQL Databases on Google Cloud
Get back to your application and leave the database to us. Cloud SQL automatically handles backups, replication, and scaling.
Cloud SQL is a fully managed relational database for MySQL, PostgreSQL, and SQL Server. We handle patching, backups, replication, encryption, and failover—so you can focus on your app. Migrate from on-prem or other clouds with free Database Migration Service. IDC found customers achieved 246% ROI. New customers get $300 in credits plus a 30-day free trial.
The SpeakRight Framework is a speech application framework written in Java. SpeakRight applications are fast to create and work on any (VoiceXML) speech platform. Applications are written in Java with full debug and unit testing available.
JeSpeak is a Java library that bridges eSpeak, which is a compact open source software speech synthesizer. JeSpeak uses JNI to make native call to libespeak.
Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.
Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
AGTK is a suite of software components for building tools for annotating linguistic signals,
time-series data which documents any kind of linguistic behavior (e.g. audio, video).
The internal data structures are based on annotation graphs.
vox-attendant is a telephony voice enabled virtual receptionist designed to work with VoiceXML and includes a distribution for the Voxeo Prophecy platform. It provides a web based interface for managing the directory.
Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.
Internet Text Radio,designed around freeTTS,connects to a text server and tunes to a channel.The server starts pumping text data for that channel to the client, which converts text to speech, playing back the text as audio,like an internet radio station.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
Go from Data Warehouse to Data and AI platform with BigQuery
Build, train, and run ML models with simple SQL. Automate data prep, analysis, and predictions with built-in AI assistance from Gemini.
BigQuery is more than a data warehouse—it's an autonomous data-to-AI platform. Use familiar SQL to train ML models, run time-series forecasts, and generate AI-powered insights with native Gemini integration. Built-in agents handle data engineering and data science workflows automatically. Get $300 in free credit, query 1 TB, and store 10 GB free monthly.
The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.
Project c2h - cetacean to human - building Seadragon, a tool for the scientific research of the acoustic communication of cetaceans, supporting the creation, emission, and recognition of underwater whistles. The blog: http://leafyseadragon.blogspot.com/
The Lazybones project is a server and a collection of clients. It allows one person to voice control multiple machines. Designed for software developers and sys admins, Lazybones speeds up redundant and/or complex tasks.
A subtitle agent, which developed on Java, helps to create, modify, and operate subtitle files easily. Also providing Java packages for developing subtitle agents conveniently. Moreover, they're free and open-sourced, based on GPL lisence.
DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
Voxapl is a suite of voice-enabled applications supporting voice recognition and text to speech functions for home automation services. It is written in Java and communicates via xAP and/or xPL messaging.
Time of day service over telephone using Voicent Gateway, a VoiceXML gateway that specially designed for voice modems. A Free version is available for download at http://www.voicent.com/download. Sample code for interactive telephony applications.
OC Volume is a speech recognition engine written in Java for integration with other applications. It is currently an User-Dependent Isolated Word Recognizer and can be expanded to include more capability for recognition.
VoIM is a Java based Jabber client that will provide excellent text instant messaging capabilities as well as the ability to send and receive voice messages.
jATLAS is a Java implementation of ATLAS [Architecture and Tools for Linguistic Analysis Systems]. For more information, see http://jatlas.sourceforge.net.
<h3>SpeaR</h3><b>SpeakingRoomplan</b>
<p>
SpeaR, der sprechende Raumplan, liest Ereignisse aus eine DB aus
und gibt entsprechende Meldungen in verschiedenen Räumen aus.</P>
<br>DB <--JDBC--> Server <--JINI--> SpeakingClients