Transform your applications and workflows into powerful agentic systems at global scale.
Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
Get Started Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Automated Attendance System (AAS) uses 2 modes for authentication -
* Voice Identification System (VIS)
* Fingerprinting Method
The algorithms used for the same has been developed by me. This algo is more efficient and faster.
The Open VXI VoiceXML interpreter is a portable open source library that interprets the VoiceXML dialog markup language. It is designed to serve as a reference for parties interested in understanding how VoiceXML markup might be executed.
Speech profile of Person is created that contains elementary sounds uttered. Profile is 1 time download for listeners. The actual audio sample is encoded based on profile. Decode using the profile stored earlier by User, and the audio can be regenerated.
A language teaching program and library based on C. It includes sound snippets featuring native speakers. You can create, edit and use various lessons and learn via an optional GTK2 interface.
DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
A plugin for gaim that interfaces with the popular program festival. It allows for instant messages to be spoken by festival so you can hear it thru your speakers.
Time of day service over telephone using Voicent Gateway, a VoiceXML gateway that specially designed for voice modems. A Free version is available for download at http://www.voicent.com/download. Sample code for interactive telephony applications.
gwavmerger is an interactive memory training tool designed to facilitate the learning of foreign languages. It helps you memorize long passages of text by heart.
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud
Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
The Pawn will make it possibly for you to tell the computer exactly what you would like it to do. Fiction. No its reality now. The highly customizable slackware will be the base for Pawn.
Portable GUI Ogg/Speex/FLAC audio encoder/player that can encode wave file and provides additional functionality such as audiofile tagging, html-album generator, cd-ripping, etc. Targeted to be used on Freebsd, Linux and win32 platforms and the frontends
The purpose of this project is to provide a biometric security solution by using voice print, fingerprint and/or facial recognition along with a password and/or smart card support using AES to protect data. Please read forums for if interested.
VoiFax is a program that manage voice/data/fax modem in the same manner of vgetty and mgetty.VoiFax is thinking for use in the small businness as in the enterprises. It is fully compatible with vgetty scripts and manage (via efax) modem fax 1.0/1.1/2.0
OC Volume is a speech recognition engine written in Java for integration with other applications. It is currently an User-Dependent Isolated Word Recognizer and can be expanded to include more capability for recognition.
SAA (SSPLab Audio Analyzer)
It will be able to separate sources, recognize speech and analyze
the auditory scene. It can also synthesize spatialised sounds from
mono recording, edit, analyze via spectrogram, filter and re-sample
signals.
TuxTalk is a software only speech synthesizer toolkit under the GPL. It\'s main goal is to allow the blind a open sourced and maintained SUI for end users at a kernel module leval and a open sourced library for developers who wish to support blind users.
A XFig based rapid prototype yeilding an audio speed alteration tool. This tool lets you arbitrarily alter the speed of audio files. It uses the WSOLA algorithm for audio speed alteration without pitch change.
TransKribe is a very simple, rather unfinished KDE application designed to aid in the task of transcribing audio (speech) recordings. The most important feature are playback control via easily accessible keys and automatic insertion of time-marks.
Talkbox is a program wich makes your computer talk "with" you. It has a AI based on ALICE program C and uses Festvial speech engin along with speechd to produce voice synthisis. You input text by typeing there is no support for voice reconition.
Libmluv is a C/C++ programmers library to provide the
Czech text-to-speech synthesis and should be able to do transcription of Czech text
to string of phonemes.
ViGiL is supposed to be a platform-independent tool for singing students. In it's final version it should be able to analyze a voice recording (read from audio file or microphone) and compare it to a given score according to melody, rhythm and dynamics.