Showing 49 open source projects for "voice"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. ...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    Leku

    Leku

    Map location picker component for Android

    ...Component library for Android that uses Google Maps and returns a latitude, longitude and an address based on the location picked with the Activity provided. Note that you have the voice_search_extra_language that is used for the language of the voice recognition. Replace it with the allowed voice recognition locale for your language. We encourage you to add these languages to this component, please fork this project and submit new languages with a PR. It's possible to hide or show some of the information shown after selecting a location. Using the bundle parameter LocationPickerActivity. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Manyfold

    Manyfold

    A self-hosted digital asset manager for 3d print files

    ...Instead of forcing users to download native apps or create accounts on closed metaverse services, Manyfold runs entirely in the browser, letting people join 3D spaces with simple links and participate in real time using avatars, voice chat, and object interaction. Users can build or import shared 3D worlds, arrange media, embed content, and design interactive layouts that support presentations, workshops, social events, games, and team gatherings without heavy software installations. The platform emphasizes accessibility and flexibility, making 3D collaboration as easy as sharing a document link while still providing spatial audio, synchronized interactions, and physics-aware environments that feel alive and responsive.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    emid

    emid is a Python program for voice emotion identification experiments.

    emid is a Python program for voice emotion identification experiments. Further info can be found on the project webpage: https://samcarcagno.altervista.org/emid/emid.html
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    OpenAvionics
    OpenAvionics is a hardware/software projet to design low cost instrumentation (air/attitude, navigation, voice, engine) for light and ultra light aircrafts. The display part of the project may be used in aircraft simulation projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DoSA-3D

    DoSA-3D

    3D open source actuator simulation software

    DoSA-3D is a 3D open source software for magnetic force analysis of actuators and solenoids. Not only individuals but also companies can use the program for free and participate in the development of it themselves. The program environment is developed to be similar to that of product development, so even product developers who have not majored in analysis can easily analyze the magnetic force of actuators. In DoSA-3D, three programs are connected and operated as follows. - DoSA-3D :...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    DoSA-2D

    DoSA-2D

    2D open source actuator simulation software

    DoSA-2D is a two-dimensional open source software for magnetic force analysis of actuators and solenoids. Not only individuals but also companies can use the program for free and participate in the development of it themselves. The program environment is developed to be similar to that of product development, so even product developers who have not majored in analysis can easily analyze the magnetic force of actuators or solenoids. DoSA-2D is responsible for an easy working...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    GPS Datalogger Device Control
    i-Blue 747 / i-Blue 757 / Qstarz BT-Q1000 / i.Trek Z1 / Konet BGL-32 / Holux M-241 / ... control SW (for Java Phones, PalmOS, WinCe (PPC), Java platforms, Windows, Linux, and MacOS). Compatible with most MTK GPS Chipset based loggers.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    guglinatts-en

    guglinatts-en

    Guglina TTS, special edition: in English (guglinatts-en)

    ...In addition to being an assistive technology tool, voice synthesizers can still have educational and entertainment applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Personal Assistant Macro Maker

    Personal Assistant Macro Maker

    a personal assistant tool that you can teach custom commands to

    ...PAMM is able to record user actions such as clicking and typing and store those actions into a macro which the user can then name as any english phrase. Users can then run PAMM in the background where it will listen to the user's voice requests and perform requested macros.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GoogleTranslate

    GoogleTranslate

    GoogleTranslate

    Google Translate Mac Client. All known issues have been fixed and the user experience has been optimized, but there may still be a few bugs. In the new version, no matter which translation engine you use, it will first call the detection language interface of domestic Google Translate. In this case, the traffic of your proxy node is abnormal, which causes the request to be intercepted by Google, and you need to enter the verification code (you can also use + + to open the debugging...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Vital Sign Simulator

    Vital Sign Simulator

    Patient vital sign simulator for medical emergency training purposes

    The vital sign simulator simulates a patient monitor and is intended for use in medical emergency training simulations. In combination with a (cheap) cpr-manikin, it offers a low-cost alternative to commercial high-tech patient simulation manikins. It is used with a dual monitor system, one monitor with controls for the operator and one providing the vital signs to the trainees. Heart rate, oxygen saturation, etCO2, respiratory rate, blood pressure and various moving ecg-samples can be set...
    Leader badge
    Downloads: 35 This Week
    Last Update:
    See Project
  • 15
    Al-Mintiq: Arabic eSpeak

    Al-Mintiq: Arabic eSpeak

    Arabic voice files for eSpeak system

    Arabic files and voices for eSpeak Text to speech system, المنطيق : ملفات اللغة العربية لبرنامج توليد الكلام من النص إسبيك
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Arduino Multiple Controller for Windows

    Arduino Multiple Controller for Windows

    A Windows app to control Arduino devices via serial port or ethernet

    Features: 1.Controls Arduino via Serial port and Ethernet(web) 2.Has 4 controls: Button,Voice,Command, Home automation (WEB). 3.Personalize themes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    WS1281 decoder

    Decoding of OOK signals transmitted by a weather sensor on 433,82 Mhz

    WS1281DEC is a portable Win32 application that extract and display the data transmitted by WS1281 weather station external sensor, using the sound device of a PC computer and some additional hardware and software. Hardware requirements are RTL-SDR dongle (e4000 or R820T tuner ), external antenna, and a PC. As additional software, it is necessary to have SDR Sharp and Virtual Audio Cable. The whole system was tested on Windows 7 64 bit OS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    ...The auto generated translations are then enriched with human alternation using an easy graphical user interface. Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these translated text. Finally the subtitle, translated audio (if any) would be muxed with the original video and uploaded.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    skillful robot

    skillful robot

    Adaptive mobile robot controlled by hand movement and sound commands

    Control system for skillful robot : Windows / language C# / libraries used in the project (Emgu CV, System.Speech, System.Threading, System.IO) Adaptive intelligent mobile robot and is controlled by the movement of the hand (via the camera) and by voice commands the control program use new technologies and offer new and cheap way to control remote without extra devices like 'kinect' In hardware part I have adopted simplicity on the idea and the goal was access to a flexible structure easily adjustable so I used meccano pieces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    SAMI

    SAMI

    Home Automation System for A Typical Apartment

    SAMI is an extensible, voice-controlled home automation system which seamlessly controls all of the pieces of your apartment or house, without installation or monthly fees. For more information about how to use her, see the documentation tab!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SITPLUS
    SITPLUS is a free software framework whose main goal is to provide recreational activities for people with multiple disabilities. It offers new forms of interaction based on computer vision, voice and other peripherals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    gps_chung
    gps chung is an open source GPS algorythm program. It calculates the shortest way path from start to end out of an array of road segments (position, direction) . Source code gps_chung.bi is included as example in a small application game garden_chung (freebasic) lite version of circuit_chung road circuit game with random map generation , home garden edit and gps speech voices.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 24
    Mescribbler Community is a completed electronic medical record, EMR, optimized for handwriting and voice recognition on the Tablet PC. It is currently in use by hundreds of doctors worldwide. Features: billing, progress note, document management, Rx.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ELIA(Eyegaze Language Integration Analysis) supports the analysis of eye-tracking data for studies in language processing. ELIA eases early analysis of data to enable iterative development of experiments in response to spoken language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB