20 projects for "audio processing" with 2 filters applied:

  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 1
    MATLAB Deep Learning Model Hub

    MATLAB Deep Learning Model Hub

    Discover pretrained models for deep learning in MATLAB

    Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    ...Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. Partial hypotheses stream as words are recognized, then stabilize with minimal jitter as confidence increases, which is crucial for usability. The code emphasizes efficient use of CPU and neural accelerators to balance battery life with responsiveness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 5
    GTK+ IOStream

    GTK+ IOStream

    GTK+iostream, Data plots, ORB, Neural Networks, WSOLA

    ...You can also plot like so : Plot figure; figure.plot(x, y, x.size()); Plotting utilises GtkDataBox.sf.net ORB utilises www.zeroc.com Feed forward neural networks with different activation functions. Audio playback/recording and digital signal processing (DSP) - utilising Jack www.jackaudio.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    NASPRO
    NASPRO, acronym for "NASPRO Architecture for Sound PROcessing" is a collection of free and open source sound processing software built around the LV2 plugin standard.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    FxEngine Framework
    The FxEngine is an Open C++ Framework under LGPL license. The FxEngine Framework simplifies the plugin architecture for the data flow processing. It provides a full control to the plugin architecture for applications that require custom solutions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software. Icon
    Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.

    Banks, lending institutions

    Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
    Learn More
  • 10

    Bermuda Text-to-Speech

    This project includes basic NLP and DSP techniques for Text-to-Speech

    See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PNP is a program for tutorial purposes in the signal processing area. It allows to set, move and remove poles and zeros of an one dimensional digital filter. The program shows different effects of the filter in the frequency domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Advanced Speech Signal Analysis library provides a structure to handle various file formats and a variety of analysis functions commonly used in speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    esweep is a scriptable audio measurement program which features various signals and signal processing functions. Its main purpose is the measurement of speakers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    libDSP is a C/C++ library of digital signal processing routines, including standard vector operations, digital filtering, and transforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    FreeSP is an infrastructure for programming platform-independend online signal processing applications. Ports to specific platforms (like DSPs and microcontrollers) are made by simply exchanging the freeSP System-module.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Native MacOSX realtime, Altivec powered sound processing application
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Realtime audio synthesis library using the flexible unit generator architechture. Many uses: music, scientific sonification, generic audio processing. Multithread abilities, extensible synth framework. Similar to modular analog synths.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    KinectStreamer

    Streams data from 3D cameras over a network.

    This is an application that streams data from the Microsoft Kinect or cameras like it over a network. The program is Intended to be used in robotics applications where the controller cannot use such cameras directly due to hardware/software limitations--such as lacking usb ports or appropriate drivers--or in situations where the camera is not in close proximity to the device that needs to access it. Given that the controller can accept data from over the network, another embedded controller...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next