Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.
Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
An initiative to create something similar to the windows program Roger Wilco, Teamspeak, BattleCom and Speak Freely, allowing users from different platforms talk with each other in real time with minimal CPU and bandwidth usage. Voice chat.....
Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform.
Geographic restrictions, eligibility, and terms apply.
Dhvani is Text-to-Speech System for Indic Languages. Current C- GNU/Linux implementation supports Hindi, Kannada, Marathi, Malayalam, Gujarati, Bengali, Telugu, Panjabi, Tamil and Oriya.
festival-croatian is Croatian support for Festival speech synthesis system.
This support includes Croatian lexicon, which contains 83 entries, Croatian synthesis module, which contains Croatian phoneset, lts rules, tokenization, utterance, and accents, Croatian support for mbrola speech synthesizer, and 2 Czech voices provided by brailcom, until completing Croatian festival voice.
WAV Speech Enhancer can be used to improve the signal to noise ratio of bad quality speech recordings: - Dynamic expansion - Pink noise attenuation - Low frequency noise (50-60Hz) suppression
Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Asterisk Dialplan application, which allows you to use Lia_Phon and Mbrola as a French speech synthesizer. Application du plan de numérotation d'Asterisk, qui permet d'utiliser Lia_Phon et Mbrola comme synthétiseur vocal français sous Asterisk.
Webvoice is a text to speech cgi program. You can embed a link in a html page to send things you want to say, via sound. No software is required on the client side. Festival and sox are needed on the server. Webvoice has its own interface (if needed).
The SingIt Lyric Displayer is an XMMS plugin which displays formatted lyrics, including id3v2xx lyrics. It consists of the displayer and an integrated editor which allows one to easily insert time stamps, edit the text, and export & strip HTML.
eSpeak text-to-speech module for Asterisk. This provides the "espeak" dialplan application, which allows you to use the eSpeak TTS Engine as a speech synthesizer in Asterisk.
eXtace is a 3D audio visualization tool (or eye candy depending on how you look at it). eXtace requires ESD (Esound) for its sound input source. It performs a FFT (fast fourier transform) on audio and displays it via various graphical modes.
Flite text-to-speech module for Asterisk. This provides the "Flite" dialplan application, which allows you to use the Flite TTS Engine as a speech synthesizer in Asterisk.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
The Lazybones project is a server and a collection of clients. It allows one person to voice control multiple machines. Designed for software developers and sys admins, Lazybones speeds up redundant and/or complex tasks.
A language teaching program and library based on C. It includes sound snippets featuring native speakers. You can create, edit and use various lessons and learn via an optional GTK2 interface.
SAA (SSPLab Audio Analyzer)
It will be able to separate sources, recognize speech and analyze
the auditory scene. It can also synthesize spatialised sounds from
mono recording, edit, analyze via spectrogram, filter and re-sample
signals.
TuxTalk is a software only speech synthesizer toolkit under the GPL. It\'s main goal is to allow the blind a open sourced and maintained SUI for end users at a kernel module leval and a open sourced library for developers who wish to support blind users.
Talkbox is a program wich makes your computer talk "with" you. It has a AI based on ALICE program C and uses Festvial speech engin along with speechd to produce voice synthisis. You input text by typeing there is no support for voice reconition.
Cowpie is an application for synchronizing character actions with sound files and animation tools like Blender and 3DS Max. Such actions include the phonemes of speech and facial expressions. Cowpie will also coordinate speech for multiple characters.