Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.
Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
This program runs on XP/2000/NT plataform using the Microsoft .NET Framework and Microsoft SAPI speech / voice engine.
Monitors an unlimited number of files on local or remote filesystems , for changes and then speak the content
Portable GUI Ogg/Speex/FLAC audio encoder/player that can encode wave file and provides additional functionality such as audiofile tagging, html-album generator, cd-ripping, etc. Targeted to be used on Freebsd, Linux and win32 platforms and the frontends
The purpose of this project is to provide a biometric security solution by using voice print, fingerprint and/or facial recognition along with a password and/or smart card support using AES to protect data. Please read forums for if interested.
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud
Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
VoiFax is a program that manage voice/data/fax modem in the same manner of vgetty and mgetty.VoiFax is thinking for use in the small businness as in the enterprises. It is fully compatible with vgetty scripts and manage (via efax) modem fax 1.0/1.1/2.0
EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.
OC Volume is a speech recognition engine written in Java for integration with other applications. It is currently an User-Dependent Isolated Word Recognizer and can be expanded to include more capability for recognition.
SAA (SSPLab Audio Analyzer)
It will be able to separate sources, recognize speech and analyze
the auditory scene. It can also synthesize spatialised sounds from
mono recording, edit, analyze via spectrogram, filter and re-sample
signals.
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.
Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
TuxTalk is a software only speech synthesizer toolkit under the GPL. It\'s main goal is to allow the blind a open sourced and maintained SUI for end users at a kernel module leval and a open sourced library for developers who wish to support blind users.
A XFig based rapid prototype yeilding an audio speed alteration tool. This tool lets you arbitrarily alter the speed of audio files. It uses the WSOLA algorithm for audio speed alteration without pitch change.
TransKribe is a very simple, rather unfinished KDE application designed to aid in the task of transcribing audio (speech) recordings. The most important feature are playback control via easily accessible keys and automatic insertion of time-marks.
Talkbox is a program wich makes your computer talk "with" you. It has a AI based on ALICE program C and uses Festvial speech engin along with speechd to produce voice synthisis. You input text by typeing there is no support for voice reconition.
Libmluv is a C/C++ programmers library to provide the
Czech text-to-speech synthesis and should be able to do transcription of Czech text
to string of phonemes.
ViGiL is supposed to be a platform-independent tool for singing students. In it's final version it should be able to analyze a voice recording (read from audio file or microphone) and compare it to a given score according to melody, rhythm and dynamics.
Cowpie is an application for synchronizing character actions with sound files and animation tools like Blender and 3DS Max. Such actions include the phonemes of speech and facial expressions. Cowpie will also coordinate speech for multiple characters.
Zfestival is a graphical interfase to festival command line. With Zfestival you can enter a text to the edit line and listen the speech with festival, also you can browse your text files and then listen it.
Speak Freely for X is a GNOME interface for the popular Speak Freely applications. Telephony over the internet is now available in a user friendly GNOME environment!
VoiceGuard is a Win32 application that listens to what a speakers speaks into his/her mic and decides whether he/she is an authorised user or not. The system must have a sound card installed.
This project implements series of homework assignment from Columbia university course.
The main target is to create a SIP enabled thin audio client for Linux.