20 projects for "audio test" with 1 filter applied:

  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    FFmpegCommand

    FFmpegCommand

    Command library suitable for Android. It implements audio and video

    FFmpegCommand is a graphical utility designed to simplify the generation and execution of FFmpeg commands for multimedia processing tasks. It provides an interface where users can configure parameters such as codecs, bitrates, and formats without manually writing command-line instructions. The tool dynamically builds FFmpeg commands based on user selections, making complex workflows more accessible. It supports common operations such as transcoding, trimming, and format conversion....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    NanoBoyAdvance

    NanoBoyAdvance

    A cycle-accurate Nintendo Game Boy Advance emulator

    ...It is designed to emulate the GBA at a very low level, including CPU timing, DMA operations, graphics processing, and memory behavior, ensuring that even edge cases and obscure hardware quirks are faithfully reproduced. The emulator achieves extremely high compatibility, passing multiple hardware test suites and accurately running games that rely on precise timing conditions. In addition to accuracy, it introduces enhancements such as a high-quality audio mixer that improves sound output without altering internal emulation behavior. The project also includes modern usability features like save states, controller remapping, and support for loading ROMs from compressed archives.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    gTTS

    gTTS

    Python library and CLI tool to interface with Google Translate

    ...A small CLI utility, gtts-cli, makes it easy to test or batch-generate MP3 files right from the shell.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    ...The repository also includes example scripts and datasets for common multimodal tasks (e.g. retrieval, visual question answering, grounding) so you can test and compare models end to end. Installation supports both CPU and CUDA, and the codebase is versioned, tested, and maintained.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    SOFA Toolbox

    SOFA Toolbox

    Spatially Oriented Format for Acoustics

    This project provided the SOFA Toolbox, a toolbox for the spatially oriented format for acoustics (SOFA). The SOFA Toolbox is the successor of the SOFA API for Matlab/Octave. More information about SOFA can be found here: http://sofaconventions.org On July 2025, we have left SourceForge: The SOFA Toolbox 2.2.0 is the last release published here. All further releases can be found at the SONICOM Ecosystem: https://ecosystem.sonicom.eu/tools/ The code development happens on GitHub:...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 6
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ufonet

    ufonet

    UFONet - Denial of Service Toolkit

    UFONet - Is a set of hacktivist tools that allow launching coordinated DDoS and DoS attacks and combine both in a single offensive. It also works as an encrypted DarkNET to publish and receive content by creating a global client/server network based on a direct-connect P2P architecture. + FAQ: https://ufonet.03c8.net/FAQ.html -------------------------------------------- -> UFONet-v1.8 [DPh] "DarK-PhAnT0m!" (.zip) -> md5 = [ c8ab016f6370c8391e2e6f9a7cbe990a ] -> UFONet-v1.8...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8

    pydatascope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    VibroToolbox is a tool for research and education in vibrometry. Test your audio equipment, describe responses to vibratory stimulation and analyze directionality. It works with precision i/o hardware or simple sound cards, various sensors and speakers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    ILIAS LMS
    ILIAS is a web base learning management system (LMS, VLE). Features: Courses, SCORM 1.2 and 2004, mail, forum, chat, groups, podcast, file sharing, authoring, CMS, test, wiki, personal desktop, LOM, LDAP, role based access, see http://www.ilias.de/
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11

    pyscope

    Software oscilloscope using Python and tkinter

    Software oscilloscope using Python and tkinter. Supports multiple sources: socket, file, audio, USB. Displays data by samples, time or frequency. Scales the input automatically or manually. It has been renamed "pdatascope" to avoid a name-clash with Pyscope, a scoping package on PyPi. See https://sourceforge.net/p/pydatascope/ for the latest code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    MP3 Dynamic Range Analyzer

    Tests dynamic range of mp3 files

    Mp3 files DYNAMIC RANGE analyzer, test dynamic range of your mp3 files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Zwicker s Loudness Calculation SW +Tool

    Zwicker's Loudness Calculation (ISO532B) Tools

    Zwicker's Loudness Calculation (ISO532B) Tools. The project includes a loudness calculation program based on Zwicker's Method, and a WAV file analyzer generating data files for loudness calculation. Documentation and test files are also provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DTMF detector library and/or application that reads in the specified audio file and returns/outputs whether DTMF detected and/or list of detected digits. May have GUI and command line interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Java software to learn to read music sheet (Piano based). Graphical interface . Several scope options plus 3 modes. //// Alemus Programa java para aprender a leer musica (para piano). Interfaz grafico. Varias opciones del ambito y 3 modos de uso.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    A MP3 Cataloging and Tagging utility program. This project was created more as an exercise in test-driving a Java GUI than in the interest of completing the program. Perhaps that's why the code was never completed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OpenAI Realtime Console

    OpenAI Realtime Console

    React app for inspecting, building and debugging with the Realtime API

    ...The Realtime API enables low-latency, interactive communication with language models, supporting use cases such as live conversations, real-time transcription, and interactive applications. This console serves as a reference implementation, showing how to establish WebRTC or WebSocket connections, send audio or text inputs, and receive model outputs in real time. It is built as a simple frontend that developers can run locally to test and understand how Realtime API interactions work. The project is intended as an educational and debugging resource rather than a production-ready application. By offering clear examples of streaming inputs and outputs, the console helps developers accelerate prototyping of real-time AI-powered applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    wav2vec2-large-xlsr-53-portuguese

    wav2vec2-large-xlsr-53-portuguese

    Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input

    wav2vec2-large-xlsr-53-portuguese is an automatic speech recognition (ASR) model fine-tuned on Portuguese using the Common Voice 6.1 dataset. It is based on Facebook’s wav2vec2-large-xlsr-53, a multilingual self-supervised learning model, and is optimized to transcribe Portuguese speech sampled at 16kHz. The model performs well without a language model, though adding one can improve word error rate (WER) and character error rate (CER). It achieves a WER of 11.3% (or 9.01% with LM) on Common...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    wav2vec2-large-xlsr-53-russian

    wav2vec2-large-xlsr-53-russian

    Russian ASR model fine-tuned on Common Voice and CSS10 datasets

    wav2vec2-large-xlsr-53-russian is a fine-tuned automatic speech recognition (ASR) model based on Facebook’s wav2vec2-large-xlsr-53 and optimized for Russian. It was trained using Mozilla’s Common Voice 6.1 and CSS10 datasets to recognize Russian speech with high accuracy. The model operates best with audio sampled at 16kHz and can transcribe Russian speech directly without a language model. It achieves a Word Error Rate (WER) of 13.3% and Character Error Rate (CER) of 2.88% on the Common Voice test set, with even better results when used with a language model. The model supports both PyTorch and JAX and is compatible with the Hugging Face Transformers and HuggingSound libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB