One platform to build, fine-tune, and deploy ML models. No MLOps team required.
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Earn up to 16% annual interest with Nexo.
More flexibility. More control.
Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform.
Geographic restrictions, eligibility, and terms apply.
Ssspeak is a framework providing an SSML speech synthesizer wrapper around festival (the synthesizer) and a Firefox extension enabling your browser to read HTML pages to you. Via audible notifications and voice announcements about the state of the page be
Zanzibar is a complete, standards based IVR. It includes an MRCPv2 Server with ASR and TTS engines as well as an voiceXML interpreter so that you can deploy and run voiceXML applications. It integrates with VOIP PBX’s (like Asterisk) using SIP and RTP.
This project is being developed to be a Java based speech recognition (SR) program. In addition to the SR program itself, it includes a program which allows a user to view the sound being received by the computer. The user can manipulate this data.
TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly GUI for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.
Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
A JNI wrapper for pjsip. You can use this wrapper to develop Java applications using the pjsip library. At the moment only the pjsua API is implemented. If you would like to obtain a commercial license, or need customisations, please contact us.
This is a Linux project that acts as a front end to cdparanoia, sox, and ffmpeg with the hope of making it incredibly simple to rip many audiobook cds into one mono, audiobook (m4b) format file for use in audio players capable of playing audiobooks.
An IDE for visually impaired users. It supports compiling and immediate error line focus, automatic code clean-up and not to mention all screen-readers E.G. NVDA. Sorry Linux can't work. Also, does NOT require Java Access Bridge.
Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab
This is a simple DSP simulator for educational purposes: developed as a course supplement of CIT-4617 (Digital Signal Processing) at Islamic University of Technology (IUT). Written in C++.
GRANULE is a flashcards program based on Leitner cardfile methodology for learning new words. It features long-term memory training capabilities with scheduling, integrated pictures, sound, and full-screen mode.
This is an application where one can set the task to be reminded of in future and you will be notified at that time by voice. You have the option of choosing male/female voice too. Besides, you can choose the time in seconds, minutes or hours.
1.) Investigation with cosine transform, and anti transform algorithm, with some voice recognition code. 2.) Translator: Croatian, English. 3.) 2D to 3D picture algorithm (principle) and new 2Dto3D video conversion code with AviSynth video scripting
Cairo sets out to provide an enterprise grade, MRCPv2 compliant speech solution utilizing existing open source speech resources such as FreeTTS and Sphinx-4.
Simple testing tool to generate RTP data packets and send it via netwok interface or save into pcap file. Primarily intended for use with SIPp application to test speech quality with different codecs.