Showing 26 open source projects for "audio engineering"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    Pure Data Computer Music System
    The Pure Data (short Pd) project on SourceForge unifies the extensions (or externals) written for the Pure Data Graphical Computer Music System written by Miller Puckette with contributions from many others.
    Downloads: 62 This Week
    Last Update:
    See Project
  • 2
    Eagle Mode

    Eagle Mode

    Zoomable user interface (ZUI) with file manager, file viewers and more

    Eagle Mode is a zoomable user interface (ZUI) with file manager, file viewers, games, and more. This project is about a futuristic style of human-computer interaction, where the user can visit almost everything simply by zooming in. It features a portable C++ toolkit API for developing such ZUIs, and it features the Eagle Mode application, which provides a virtual cosmos with plugin applications. The most important one is the professional file manager with its rich, scriptable command...
    Leader badge
    Downloads: 55 This Week
    Last Update:
    See Project
  • 3
    Source code from the Research Institute for Signals, Systems and Computational Intelligence http://fich.unl.edu.ar/sinc
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    slab3d

    slab3d

    slab3d is a real-time virtual acoustic environment.

    slab3d is a real-time virtual acoustic environment rendering system. It includes two rendering applications, libraries for spatial auditory display and audio signal processing development, and a set of tools for HRTF analysis and visualization.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Myo Mapper

    Myo Mapper

    An open-source software to map Myo data into OSC and MIDI messages.

    MyoMapper, an open-source software to rescale and centre and map Myo data into OSC and MIDI messages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Kromophone

    Kromophone

    A color sonification device for navigation and classification

    The Kromophone is a color-to-sound sensory substitution device. It was designed with the tasks of navigation, object, and environmental navigation in mind.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10

    AhoTTS Iparrahotsa

    TTS for Basque Lapurdian dialect

    AhoTTS Iparrahotsa is the TTS developed at the Aholab Signal Processing Laboratory of the University of the Basque Country (UPV/EHU) for the Lapurdian dialect of Basque. This dialect is spoken at the Northern area of the Basque speaking area (French region). This project was funded by the Euroregion Aquitaine-Euskadi under grant EUSKADI-2012-004.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present. The output files are stored in HTK format. The software is developed at Department of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Cotovía

    Cotovía

    Text-to-Speech System for Galician and Spanish

    Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities,...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 13
    LeapInto

    LeapInto

    Simplified interface to Leap Motion designed for art and music apps

    LeapInto provides a simplified interface to the Leap Motion hand sensor input device. Multiple hand recognition is simplified to several stable categories and coordinates are normalised. The interface comes two flavours at present, an open broadcast system using the OSC protocol and a plugin for the Csound audio/music programming language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    SMILE = Speech & Music Interpretation by Large Space Extraction openSMILE is a fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research developed originally at TUM in the scope of the EU-project SEMAINE, now maintained and supported by audEERING.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    XVoice provides voice control of X applications using IBM's ViaVoice for Linux (free download at their web site). Both user-defined commands and dictation are supported. It can be used to write letters, write code, control netscape, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    A sensory-substitution software to convert in real-time a video stream into an audio stream (audio to video conversion)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Wechselspieler is an interactive motion tracking tool to transform movement information coming from a file or video into sound in realtime using the MIDI or OSC protocol. It features an easy to use GUI for defining/editing the image trigger regions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Kbdz (KeyboardZ) is a realtime events framework, dedicated to audio. It allows to tranform linux event devices to midi controller, to perform transformation (mapping) on the events. It will implement ODL (Osc Discovery and Learn protocol)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    OpenDragon is a collection of external objects for the Max graphical environment from Cycling '74 (http://www.cycling74.com/products/maxmsp.html) that focuses on device interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MFFM MTC - a type II signal filter and C++ heirachy which controls and describes time code in two modes, INTEGER and DIGIT mode. INTEGER mode for math. manip. (+, -, *, /, etc.). DIGIT mode allows manip. of time digits (0:19:59 -> 0:20:09 click on tens)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TacitPixel is a scalable C++ framework for building application prototypes for interactive collaborative VR/AR applications. It provides a number of features that make prototyping fast and portable. TacitPixel is the successor of the TAP and Twisted Pair engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    EBBA is a project aiming to develop an advanced chatbot by combining AIML, 3d facial expressions, speech synthesizer, speech recognition and an iq-test solving functionality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Voice stress analysis tool. Sometimes used as an audio lie detector (ie. used for detecting stress in a person's voice to allow for the determination of truthfulness).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 25
    VoiceGuard is a Win32 application that listens to what a speakers speaks into his/her mic and decides whether he/she is an authorised user or not. The system must have a sound card installed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo