Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.
Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
Simple testing tool to generate RTP data packets and send it via netwok interface or save into pcap file. Primarily intended for use with SIPp application to test speech quality with different codecs.
This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
XVoice provides voice control of X applications using IBM's ViaVoice for Linux (free download at their web site). Both user-defined commands and dictation are supported. It can be used to write letters, write code, control netscape, etc.
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.
Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
A Biblia Falada é um software para leitura e estudo da Biblia Sagrada. Muito simples de usar e totalmente acessível para deficientes visuais, traz, além do novo sistema de leitura, os textos completos da edição Revista e Atualizada.
Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
Asterisk Dialplan application, which allows you to use Lia_Phon and Mbrola as a French speech synthesizer. Application du plan de numérotation d'Asterisk, qui permet d'utiliser Lia_Phon et Mbrola comme synthétiseur vocal français sous Asterisk.
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud
Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
This is an application that takes the input of ABNF code and then converts it to GRXML. Both standards adhere to the W3 standard of grammars for speech recognition.
Easily add interactive animated characters to any application. Can be directly used as a C++ library or through COM\ActiveX wrapper. It makes it easy to trigger animations, set emotions, and speak using a SAPI 5 speech synthesiszer. Built on OGRE3D.
MyDSReader is an homebrew for the Nintendo DS that helps visually impaired users:
1. Read documents in digital format (text, word, pdf, DAISY)
2. Take voice annotations
3. Read e-mails and reply/write using recorded voice clips
audacity-extra now provides a sleek dark themed version of the Audacity open source sound editor. The project experiments with Audacity variations. There's a vowel-sound target-practice display for language learners and an analog waveform data logger for embedded systems.
Linux on Sound is a project to create a transparent console interface over eSpeak (http://espeak.sourceforge.net/), to increase linux console's accessibility.
Webvoice is a text to speech cgi program. You can embed a link in a html page to send things you want to say, via sound. No software is required on the client side. Festival and sox are needed on the server. Webvoice has its own interface (if needed).
A plugin for pidgin that interfaces with the popular program festival. It allows for instant messages to be spoken by festival so you can hear it through your speakers.
A collection of scripts and programs to automatically annotate video/audio for subtitles. Basically relies on a MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) plug-in for detecting human voice in polyphonic recordings.
C# app (using Windows RSS Platform, Microsoft Feeds 2.0) with SysTray icon and pop-up balloons to indicate new feed items. Provides hook for Text-To-Speech, by starting a (configurable) process (like Festival) with each new item as command-line argument.
This is a Java Wrapper for Cepstral.com text-to-speech engine. Cepstral makes very affordable realistic synthetic voices and provides the developers with C++ API's. We have developed a JSAPI compliant Java-to-JNI-to-C++ Wrapper to use with Cepstral TTS.
Voice is a text to speech program with many features. Some of the features include: Reads Text, Rich Text and Word Documents aloud. Custom greeting. Professional document editor. Clipboard monitoring and processing. Good looking animated character.
Speech based User Interface Components Library for Java is a project to create Java controls and applications that can be used not only by literate people but also by non-literates. Speech and visual element with minimal text is used to create components