Showing 42 open source projects for "audio engineering"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 1
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP, GraphQL protocols with TLS. Intuitive design pattern for high-performance microservices. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Eagle Mode

    Eagle Mode

    Zoomable user interface (ZUI) with file manager, file viewers and more

    Eagle Mode is a zoomable user interface (ZUI) with file manager, file viewers, games, and more. This project is about a futuristic style of human-computer interaction, where the user can visit almost everything simply by zooming in. It features a portable C++ toolkit API for developing such ZUIs, and it features the Eagle Mode application, which provides a virtual cosmos with plugin applications. The most important one is the professional file manager with its rich, scriptable command...
    Leader badge
    Downloads: 55 This Week
    Last Update:
    See Project
  • 4

    DuranDuranbot

    Teachable/trainable artificially intelligent music bot

    A teachable/trainable artificially intelligent music bot fundamentally inspired by how the new wave band Duran Duran composes music. This program utilizes many algorithmic/AI techniques/processes, including machine learning; which allow you to teach/train it to compose music which you prefer... and the technique which is the foundation of the design of DuranDuranbot, which was directly inspired by how Duran Duran writes music........ Called, "bit by bit circular composition"....... and it's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    Mesh2HRTF
    Mesh2HRTF is an open-source project aiming at providing an easy-to-use software package for the numerical calculation of HRTFs. It targets researchers in the field of binaural audio. In a nutshell, Mesh2HRTF simply reads geometrical data, calculates the corresponding sound field and outputs HRTFs. To support multiple computer platforms, the concept of Mesh2HRTF is to focus on a command-line tool, which forms the numerical core, i.e., an implementation of the 3-dimensional Burton-Miller...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Tensor2Tensor

    Tensor2Tensor

    Library of deep learning models and datasets

    Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Regulus is a Prolog-based toolkit for building spoken dialogue systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    vinylsimtool

    vinylsimtool

    vinylsimtool: simulation of vinyl record playback

    vinylsimtool: Simulation of audio artifacts of vinyl record playback systems, based on physical modelling. --- vinylsimtool: Simple command-line tool for WAV file processing. Copyright (c) 2017 by Klaus Michael Indlekofer. All rights reserved. --- Vinylsimtool-Plugin-VST: Real-time plug-in. Copyright (c) 2017-2019 by Klaus Michael Indlekofer and Sebastian Marcus Edelmann. All rights reserved. Note: Special restrictions apply. See disclaimers below and within the distribution. (We are...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Wheefun Computer Prototyping Kit

    A Toolkit for Designing Computers

    This package is designed for people who are a) interested in writing emulators or b) integrating this level of detain into their applications (e.g., a video game). The ability to do this is useful because a) it allows for tinkering far before physical implementation of the design is. In addition to a strong core, WFCPK will also include modules emulating various processors (e.g., the MOS 6502 and the Zilog Z80) as well as the Video-Audio Interface (a custom VGA-compatible display and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    CoRoPa stands for Computational Rough Paths. The aim of CoRoPa is to provide a software framework for various ideas related to Rough Path Theory, including rough differential equations and the digital description of serial data streams.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GTK+ IOStream

    GTK+ IOStream

    GTK+iostream, Data plots, ORB, Neural Networks, WSOLA

    Create GTK+ interfaces using IOStream style code. Its simple and quick! Also providing simple data plotting (similar to Matlab/Octave), ORB (Object Request Broker), and Neural Network computations. You can create GTK+ GUIs in a few lines of code. Labels<<"Thanks for reading"; (HBox<<Labels).show(); or even one line : (HBox() << (Labels()<<"Thanks for reading")).show(); Inline code destructs the classes, but leaves the widgets/callbacks operating. This is great for embedded...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SuperCollider

    SuperCollider

    Real-time audio synthesis engine and music programming language

    N.B. SuperCollider is now hosted at GitHub, and the most recent versions can be downloaded from there. http://supercollider.github.io A real time audio synthesis engine, and an object-oriented programming language specialised for music. SuperCollider was created in 1996 by James McCartney, now (since version 3) released as free software.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    ArduinoRCLib

    Library for Arduino based R/C equipment

    A library containing functions for building Arduino based R/C equipment like transmitters, receivers and flight controllers.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    TimeDoctor
    TimeDoctor is a tool to visualize execution traces of tasks, queues, cache behavior, etc. While originally targeting embedded media processors and includes specific features for analyzing audio/video streaming applications it has wider applicability.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    AVRS performs spatial 3D-sound processing allowing the arbitrary placement of sound source in auditory space. The main goal is to develop a genuine experimental flexible platform, that runs in a general-purpose architecture computer (standard hardware and software).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19

    falabaac

    AAC encoder, C source code,support Linux and Windows

    A new AAC encoder(different with FAAC, nero aac or qtaac), very good quality. support Linux and Windows platform. Support 6 speed level, default is 3. level 3 can supply good quality of the AAC file, if you want the best quality, use level 1, but the speed is slow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Interactive4J
    Project aim to provide simple easy APIs for Java developers to use interactive abilities in their Java Applications like speech recognition, handwriting recognition, use of web cam , sound record/play, decision trees , text to speech and many others.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    CAMEL - A Framework for Audio Analysis
    CAMEL (Content-based Audio and Music Extraction Library) is an easy-to-use C++ framework developed for content-based audio and music analysis. The framework provides a set of tools for easy Segmentation, Feature Extraction, Domain Extraction, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SkyMonome facilitates Monome hardware programming in Java by manipulating simple virtual entities like groups of leds, sequencers, etc... You don't need to know the OSC protocol at all. SkyMonome uses JavaOSC library and needs only Java 5.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    CthughaNix is the reincarnation of Cthugha-L, a Linux port of one of the first audio visualization software packages, known as "An Oscilloscope On Acid".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This project intends to provide a total solution for PMP (Portable Multimedia Player), which uses MiniGUI as the embedded graphics platform. It is designed for embedded Linux system, and provides a complete PMP GUI solution and main applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo