Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.
Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Automatically translate english/french/german text to german/french/english text and output speech in appropriate language.
All Automagically with the power of the inter-webs.
JeSpeak is a Java library that bridges eSpeak, which is a compact open source software speech synthesizer. JeSpeak uses JNI to make native call to libespeak.
AGTK is a suite of software components for building tools for annotating linguistic signals,
time-series data which documents any kind of linguistic behavior (e.g. audio, video).
The internal data structures are based on annotation graphs.
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.
vox-attendant is a telephony voice enabled virtual receptionist designed to work with VoiceXML and includes a distribution for the Voxeo Prophecy platform. It provides a web based interface for managing the directory.
Internet Text Radio,designed around freeTTS,connects to a text server and tunes to a channel.The server starts pumping text data for that channel to the client, which converts text to speech, playing back the text as audio,like an internet radio station.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud
Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
A simple software that speaks a text. You can type the text or appoint a file.
Fala is just a frontend to festival. It's designed for GNOME, but if you have gtk, pyhton and festival you are able to run it.
The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.
Project c2h - cetacean to human - building Seadragon, a tool for the scientific research of the acoustic communication of cetaceans, supporting the creation, emission, and recognition of underwater whistles. The blog: http://leafyseadragon.blogspot.com/
festival-te synthesizes text in Telugu language into speech using Festival TTS. The package provides the supporting modules required to use festival for Telugu. It includes modules for text/lexical analysis and intonation/duration prediction for Telugu.
GibPhone is a highly extensible VoIP/IM client for the .NET framework that uses a powerful plugin engine to allow for UI extensions and any call control stack / media payload / transport protocol combination.
The Lazybones project is a server and a collection of clients. It allows one person to voice control multiple machines. Designed for software developers and sys admins, Lazybones speeds up redundant and/or complex tasks.
To provide basic text-to-speech capability on as many platforms and for as many spoken
languages as possible by formant synthesis from an International Phonetic Alphabet
representation.
DAI = Distributed Artificial Intelligence The projected is intended to be a test bed for AI related concepts and technologies, not necessarily an end user product, though that could change. Some of the modules can be modified for other uses.
Automated Attendance System (AAS) uses 2 modes for authentication -
* Voice Identification System (VIS)
* Fingerprinting Method
The algorithms used for the same has been developed by me. This algo is more efficient and faster.
The Open VXI VoiceXML interpreter is a portable open source library that interprets the VoiceXML dialog markup language. It is designed to serve as a reference for parties interested in understanding how VoiceXML markup might be executed.
Speech profile of Person is created that contains elementary sounds uttered. Profile is 1 time download for listeners. The actual audio sample is encoded based on profile. Decode using the profile stored earlier by User, and the audio can be regenerated.
Software to fit whole-sentence language models using the principle of maximum entropy. For developers of speech recognizers, text prediction interfaces, OCR, machine translation software.
A language teaching program and library based on C. It includes sound snippets featuring native speakers. You can create, edit and use various lessons and learn via an optional GTK2 interface.
A subtitle agent, which developed on Java, helps to create, modify, and operate subtitle files easily. Also providing Java packages for developing subtitle agents conveniently. Moreover, they're free and open-sourced, based on GPL lisence.
A plugin for gaim that interfaces with the popular program festival. It allows for instant messages to be spoken by festival so you can hear it thru your speakers.