Showing 30 open source projects for "project"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Buster

    Buster

    Captcha solver extension for humans

    Save time by asking Buster to solve captchas for you. Buster is a Firefox extension which helps you to solve difficult captchas by completing reCAPTCHA audio challenges using speech recognition. Challenges are solved by clicking on the extension button at the bottom of the reCAPTCHA widget. It is not guaranteed that challenges are always solved, the limitations of the technology need to be considered. The continued development of Buster is made possible thanks to the support of awesome...
    Downloads: 38 This Week
    Last Update:
    See Project
  • 2
    WhisperKit

    WhisperKit

    On-device Speech Recognition for Apple Silicon

    ...We founded Argmax in November 2023 to empower developers and enterprises everywhere to deploy commercial-scale inference workloads on user devices. The fast-growing need for Whisper inference in production convinced us to take it on as our first project.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Kaldi

    Kaldi

    kaldi-asr/kaldi is the official location of the Kaldi project

    Kaldi is an open source toolkit for speech recognition research. It provides a powerful framework for building state-of-the-art automatic speech recognition (ASR) systems, with support for deep neural networks, Gaussian mixture models, hidden Markov models, and other advanced techniques. The toolkit is widely used in both academia and industry due to its flexibility, extensibility, and strong community support. Kaldi is designed for researchers who need a highly customizable environment to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    FireRedASR

    FireRedASR

    Open-source industrial-grade ASR models

    FireRedASR is an industrial-grade family of open-source automatic speech recognition models designed to provide high-precision speech-to-text performance across languages including Mandarin, English, and various Chinese dialects, achieving new state-of-the-art benchmarks on public test sets. The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition using attention-based encoder-decoder architectures, giving developers flexibility in balancing performance and resource constraints. FireRedASR not only excels in traditional speech recognition tasks but also demonstrates strong capability in challenging scenarios like singing lyrics recognition, where accurate transcription is often difficult for conventional models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Omnilingual ASR

    Omnilingual ASR

    Omnilingual ASR Open-Source Multilingual SpeechRecognition

    Omnilingual-ASR is a research codebase exploring automatic speech recognition that generalizes across a very large number of languages using shared modeling and training recipes. It focuses on leveraging self-supervised audio pretraining and scalable fine-tuning so low-resource languages can benefit from high-resource data. The project provides data preparation pipelines, training scripts, decoding utilities, and evaluation tools so researchers can reproduce results and extend to new language sets. It emphasizes modularity: acoustic modeling, language modeling, tokenization, and decoding are separable pieces you can swap or ablate. The repo is aimed at pushing practical multilingual ASR—robust to accents, code-switching, and domain shifts—rather than language-by-language systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AzioSpeech Recognition and Translation

    AzioSpeech Recognition and Translation

    AzioSpeech Recognition and Translation

    Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A project which uses existing speech recognition and speech translation resources to build conversation partners for beginning language students, based on the idea of a "translation game".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Wilson Personal Assistant

    Wilson Personal Assistant

    Personal Assistant using speech recognition and speech synthesis

    ...This will allow the bot to answer any question. 3rd stage is the personal assistant. Calendar , email , finance, organization management. Media control, device & file management. The project will be kept modular. AIMLbot source: https://sourceforge.net/projects/aimlbot/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    kisrecord

    A capturing tool for speech data as input for SRS acoustic models

    This project follows the target for capturing recordings in combination with the written representation. For the recordings microphones of the kinect are used. The audio files and connected spoken text enable the training of acoustic models for speech recognition. The background is to realize crowd guided open source acoustic models for speech recognition projects like sphinx.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    VoR ~ Voice Recognition

    VoR ~ Voice Recognition

    VoR is a voice recognition tool for Android.

    It is an Open-Source project that acquire the word spoken and open the its page in Wikipedia. It include a VoR Engine ~ Voice Recognition Engine make by CMU Sphinx tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    HMM Speech Recognition in Matlab

    A speech recognition system using Matlab/Simulink/Stateflow.

    This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    High-order HMM in Java

    A duration high-order hidden Markov model (DHO-HMM) in Java.

    This project provides an implementation of duration high-order hidden Markov model (DHO-HMM) in Java. It is compactible with JDK 5 & 6. It was used in the author's research on speech recognition of Mandarin digits. There are some Chinese words in this project and I am afraid that I don't have enough time to translate to English recently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Domotic Speech-recognition interface

    Speech-recognition interface for a domotic system.

    This product recognizes oral commands and translates them to domotic orders for a domotic system. This product does not implement a domotic system. This product is an interface to be plugged to a domotic system. The speech recognition is done by an arduino UNO board and an EasyVR shield. Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    KinectCAD

    KinectCAD

    Gesture based movement with CATIA

    This project provides a gesture based movement of part objects in the CAD-system CATIA. It is possible to rotate, move or zoom in or out. Further there is a rudimentary speech recognition to change the rotating axes or to do some other helpful things. KinectCAD has been written in Visual C# 2010. The package includes the source code and binaries files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This project is being developed to be a Java based speech recognition (SR) program. In addition to the SR program itself, it includes a program which allows a user to view the sound being received by the computer. The user can manipulate this data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Interactive4J
    Project aim to provide simple easy APIs for Java developers to use interactive abilities in their Java Applications like speech recognition, handwriting recognition, use of web cam , sound record/play, decision trees , text to speech and many others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This project, npp (net plus plus, net++), is developed on top of open source package QuickNet for Neural Network training in speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Speech-recognition engine
    It's a student project for SUPINFO .Net labs. It consist to develop a speech-recognition engine for a few words with a database of .vocal stamps.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Graphical User Interface and advanced facilities for training the speech recognition system Sphinx-III (using SphinxTrain).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A voice recognition program for World of Warcraft. It allows for key presses and macros to be mapped to voice commands. It also allows voice control of in-game chat channels. This project has moved! Search "WoW VoiceBox" on codeplex.com.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A learning package for children, helping them to learn a foreign language. Techniques such as speech recognition will be used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Project VALMIKI is a speech recognition engine done from scratch. The engine will mostly be language independent, though, the developers will primarily focus on Nepali language as their chief target.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Written in VB 6 for Win98 and up. Our goal is to provide speech recognition and text to speech unlike any software currently in the market. Some features include TTS, Dictation using Microsoft SAPI 5.1 engines. Visit our Home Page for more info.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB