Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.
Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
Try Cloud Run Free
Build on Google Cloud with $300 in Free Credit
New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.
Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198
A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy.
Important Setup Requirements
Before using...
Just Another Speech Recognition and Text to Speech software.
JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
Transcription Aid helps you type text from recordings.
This software is to help type in text from speech recordings. It has several functions proven to help this type of work.
However it is fully manual (aside from auto-completion), so no speech recognition if you are looking for that, but it is a great tool to do the job.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
This project provides a gesture based movement of part objects in the CAD-system CATIA. It is possible to rotate, move or zoom in or out. Further there is a rudimentary speech recognition to change the rotating axes or to do some other helpful things.
KinectCAD has been written in Visual C# 2010. The package includes the source code and binaries files. To start KinectCAD it is necessary to have a correct installed Microsoft Kinect. Also it is helpful if there is an installed Kinect SDK...
This is an application that takes the input of ABNF code and then converts it to GRXML. Both standards adhere to the W3 standard of grammars for speech recognition.
A system for researching knowledge representation, language parsing, and derivation with a voice recognition front end. The front end, known as Grace, allows additional functionality to be developed to interact with the web, iTunes, etc.
Go from idea to deployed AI app without managing infrastructure. Vertex AI offers one platform for the entire AI development lifecycle.
Ship AI apps and features faster with Vertex AI—your end-to-end AI platform. Access Gemini 3 and 200+ foundation models, fine-tune for your needs, and deploy with enterprise-grade MLOps. Build chatbots, agents, or custom models. New customers get $300 in free credit.
A voice recognition program for World of Warcraft. It allows for key presses and macros to be mapped to voice commands. It also allows voice control of in-game chat channels. This project has moved! Search "WoW VoiceBox" on codeplex.com.
TSSBank is written in c#(.Net 2.0).The main aimed group is the disabled persons.This component outputs voice & textual outputs (with value/words)plus experimental Voice Recognition (VR) system that identifies more then 80% accurately with out training.
EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.
The Speech Recognition Media Player is designed to browse and play your music and videos only with your voice. Plug in a remote microphone to your PC and use it as a Remote Control. Really helpful for the visually handicapped.