Showing 89 open source projects for "speech recognition for hebrew"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Lip Reading

    Lip Reading

    Cross Audio-Visual Recognition using 3D Architectures

    ...This code is aimed to provide the implementation for Coupled 3D Convolutional Neural Networks for audio-visual matching. Lip-reading can be a specific application for this work. Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    ...These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    TalkMaths is a speech user interface that extends the speech recognition program Dragon NaturallySpeaking by parsing spoken mathematical expressions into MathML and/or LaTeX. TalkMaths allows the user to create documents in MathML or LaTeX hands-free.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    piffle

    Speech recognition for Ubuntu

    Speech recognition system for Ubuntu which takes Palaver as codebase and integrates Pocketsphinx instead of Google speech-api. Codebase is the minimal version of Palaver.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Domotic Speech-recognition interface

    Speech-recognition interface for a domotic system.

    This product recognizes oral commands and translates them to domotic orders for a domotic system. This product does not implement a domotic system. This product is an interface to be plugged to a domotic system. The speech recognition is done by an arduino UNO board and an EasyVR shield. Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line interface and a web application. These interfaces allow to visualize oral commands, train and delete trainings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Pythia is a natural language question answering system, which uses Speech Recognition and Text To Speech technologies to communicate with the user.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Orcus smartHome is a voice/sensor/web-interactive home automation, streaming media, security monitoring system with voice recognition, speech synthesis, scheduling via Google calendar, and web interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Voice Conference Manager uses VoiceXML and CCXML to control speech recognition, text to speech, and voice biometrics for a telephone conference service. Say the names or numbers of people and VCM places them into the call. Can be hosted on public servers
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    GoMad is a speech recognition system that allows you to control windows-based applications using your voice as input, instead of your mouse and keyboard.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Provides a voice interface for applications via a plug in system. Allows the inclusion of voice recognition in an application with a minimum of effort.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB