Showing 14 open source projects for "audio development"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Eagle Mode

    Eagle Mode

    Zoomable user interface (ZUI) with file manager, file viewers and more

    Eagle Mode is a zoomable user interface (ZUI) with file manager, file viewers, games, and more. This project is about a futuristic style of human-computer interaction, where the user can visit almost everything simply by zooming in. It features a portable C++ toolkit API for developing such ZUIs, and it features the Eagle Mode application, which provides a virtual cosmos with plugin applications. The most important one is the professional file manager with its rich, scriptable command...
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 3
    Regulus is a Prolog-based toolkit for building spoken dialogue systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Auth0 B2B Essentials: SSO, MFA, and RBAC Built In Icon
    Auth0 B2B Essentials: SSO, MFA, and RBAC Built In

    Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

    Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.
    Sign Up Free
  • 5
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Cotovía

    Cotovía

    Text-to-Speech System for Galician and Spanish

    Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities,...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 7
    Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    SkyMonome facilitates Monome hardware programming in Java by manipulating simple virtual entities like groups of leds, sequencers, etc... You don't need to know the OSC protocol at all. SkyMonome uses JavaOSC library and needs only Java 5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    HoloMenu is a C library for creating GUIs with holographic simulator view. It helps creating buttons, text boxes and other components with effects similar to Syndicate Wars. It is designed to use as menu in games, but may be used for other projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    Free Open Source VoiceXML editor programmed in Java (Swing). The VoiceXML document is regularly parsed, a tree view is built and syntax errors are reported in a specific table.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    VxmlSurfer is a modular, multi-platform, extensible VoiceXML 2.0 interpreter, geared to Web browsing as opposed to telephony. VxmlSurfer is currently used as a user interface for the HearSay non-visual Web browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    TacitPixel is a scalable C++ framework for building application prototypes for interactive collaborative VR/AR applications. It provides a number of features that make prototyping fast and portable. TacitPixel is the successor of the TAP and Twisted Pair engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Convert copy. Pipes existing converter tools together to convert files from a format to another. E.g. add the converters 'pdf -> text' and 'text -> wav' and you can convert pdf to wav. Automatically installs missing converters in Debian.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB