Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.
Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
Try It Free
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.
Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
Secure File Transfer for Windows with Cerberus by Redwood
Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.
Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour.
Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab
MediaGlyphs: an international language based on multimedia ideograms. It allows to read, think or type in your own language sentences written with the shared mediaglyphs: a common writing system for the world, Simple Unambiguous Neutral & Universal.
openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.
VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
DAI = Distributed ArtificialIntelligence The projected is intended to be a test bed for AI related concepts and technologies, not necessarily an end user product, though that could change. Some of the modules can be modified for other uses.
To provide basic text-to-speech capability on as many platforms and for as many spoken
languages as possible by formant synthesis from an International Phonetic Alphabet
representation.
DTW is intended to be a Voice in -> Pictures + Text out program written in java using Sphinx from CMU. This is intended to be useful to people who have good oral/visual literacy skills but poor written literacy skills.
A machine translation program designed to accept verbal or text input and provide text or speech synthesized voice translation as output. Makes use of 3 current open-source projects. The source is currently C/C++ and embedded perl.
The Pawn will make it possibly for you to tell the computer exactly what you would like it to do. Fiction. No its reality now. The highly customizable slackware will be the base for Pawn.
Durito: a free (as in speech) application that will manage, display and analyse various kinds of documents in a diversity of environments. Central to Durito's operation will be technologies such as XML and RDF, both cornerstones of the W3C's Semantic Web.
krmp3 is a car mp3 player which currently provides interaction through text to speech synthesis and a keypad. It allows Playlist managment, the ability to rip cd's as you drive and other administrative tasks.
Idi is a voice recognition program intended to help people with physical disabilities to use a keyboard by dictating. It can also be used as a way to remote control your computer or as a nice way to type in your bath.