Showing 18 open source projects for "audio development"

View related business solutions
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Eagle Mode

    Eagle Mode

    Zoomable user interface (ZUI) with file manager, file viewers and more

    Eagle Mode is a zoomable user interface (ZUI) with file manager, file viewers, games, and more. This project is about a futuristic style of human-computer interaction, where the user can visit almost everything simply by zooming in. It features a portable C++ toolkit API for developing such ZUIs, and it features the Eagle Mode application, which provides a virtual cosmos with plugin applications. The most important one is the professional file manager with its rich, scriptable command...
    Leader badge
    Downloads: 90 This Week
    Last Update:
    See Project
  • 3
    slab3d

    slab3d

    slab3d is a real-time virtual acoustic environment.

    slab3d is a real-time virtual acoustic environment rendering system. It includes two rendering applications, libraries for spatial auditory display and audio signal processing development, and a set of tools for HRTF analysis and visualization.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Regulus is a Prolog-based toolkit for building spoken dialogue systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 5

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Cotovía

    Cotovía

    Text-to-Speech System for Galician and Spanish

    Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities,...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 8
    Framework SRM (Sound Recognizer ME) written in Java Micro Edition capable of recognize abstract sounds and isolated words announcer dependent on mobile devices.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    SkyMonome facilitates Monome hardware programming in Java by manipulating simple virtual entities like groups of leds, sequencers, etc... You don't need to know the OSC protocol at all. SkyMonome uses JavaOSC library and needs only Java 5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The Carolina Computer Assistive Technolog group at UNC-Chapel Hill focuses on the development of enabling technologies. We are currently maintaining the pyTTS package here, but our older work is still available for (unsupported) use.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    HoloMenu is a C library for creating GUIs with holographic simulator view. It helps creating buttons, text boxes and other components with effects similar to Syndicate Wars. It is designed to use as menu in games, but may be used for other projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Free Open Source VoiceXML editor programmed in Java (Swing). The VoiceXML document is regularly parsed, a tree view is built and syntax errors are reported in a specific table.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VxmlSurfer is a modular, multi-platform, extensible VoiceXML 2.0 interpreter, geared to Web browsing as opposed to telephony. VxmlSurfer is currently used as a user interface for the HearSay non-visual Web browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The Semantic Time Framework is a multimedia "meta-framework" that provides more high-level abstractions of time. STF enables developers to more quickly and easily develop interactive applications with time-based media.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    VoxForge collects user-submitted speech audio files for the creation of Acoustic Models for Free and Open Source Speech Recognition Engines such as HTK, Julius, ISIP and Sphinx.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    TacitPixel is a scalable C++ framework for building application prototypes for interactive collaborative VR/AR applications. It provides a number of features that make prototyping fast and portable. TacitPixel is the successor of the TAP and Twisted Pair engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Convert copy. Pipes existing converter tools together to convert files from a format to another. E.g. add the converters 'pdf -> text' and 'text -> wav' and you can convert pdf to wav. Automatically installs missing converters in Debian.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB