Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.
Build generative AI apps with Vertex AI. Switch between models without switching platforms.
Start Free
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Fast and accurate automatic speech recognition (ASR) for edge devices
moonshine is an open-source automatic speech recognition toolkit optimized for fast and accurate transcription on edge devices and local environments. The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows....
The project provides a ready-to-use interface for the julius CSR engine for a handicapped child which is not able to use the keyboard well. It integrates into X11 and Windows.
Find out how you can help: http://simon-listens.org/index.php?support
festival-croatian is Croatian support for Festival speech synthesis system.
This support includes Croatian lexicon, which contains 83 entries, Croatian synthesis module, which contains Croatian phoneset, lts rules, tokenization, utterance, and accents, Croatian support for mbrola speech synthesizer, and 2 Czech voices provided by brailcom, until completing Croatian festival voice.
BladeWareVXML is a portable VoiceXML 2.1 interpreter that is an enhanced version (performance, usability and integration) of OpenVXI. A commercial version, with documentation, sample code, and support options, is available from the Commetrex Website.
...It\'s main goal is to allow the blind a open sourced and maintained SUI for end users at a kernel module leval and a open sourced library for developers who wish to support blind users.
Talkbox is a program wich makes your computer talk "with" you. It has a AI based on ALICE program C and uses Festvial speech engin along with speechd to produce voice synthisis. You input text by typeing there is no support for voice reconition.