Showing 68 open source projects for "audio testing"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 1
    SimpleX

    SimpleX

    The first messaging platform operating without user identifiers

    Other apps have user IDs: Signal, Matrix, Session, Briar, Jami, Cwtch, etc. SimpleX does not, not even random numbers. This radically improves your privacy. The video shows how you connect to your friend via their 1-time QR-code, in person or via a video link. You can also connect by sharing an invitation link. Temporary anonymous pairwise identifiers SimpleX uses temporary anonymous pairwise addresses and credentials for each user contact or group member. It allows to deliver messages...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 2
    CloakBrowser

    CloakBrowser

    Stealth Chromium that passes every bot detection test

    ...Unlike traditional browser automation tools that rely primarily on injected JavaScript patches, CloakBrowser applies source-level Chromium modifications affecting WebGL, canvas rendering, audio fingerprints, fonts, GPU reporting, WebRTC behavior, and automation detection signals. The project integrates with Playwright and Puppeteer while preserving familiar automation workflows for developers. It also supports isolated browser profiles with configurable fingerprints, making it useful for testing, automation research, scraping, QA, and multi-profile browser environments. ...
    Downloads: 51 This Week
    Last Update:
    See Project
  • 3
    ComfyUI

    ComfyUI

    The most powerful and modular diffusion model GUI, api and backend

    The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...
    Downloads: 222 This Week
    Last Update:
    See Project
  • 4
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    Spring AI Alibaba Examples provides a collection of example projects that demonstrate how to use Spring AI and Spring AI Alibaba across different scenarios, from basic setups to more advanced AI applications. It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 5
    OpenAI .NET

    OpenAI .NET

    The official .NET library for the OpenAI API

    OpenAI .NET is the official client library for calling the OpenAI REST API from C# and other .NET languages, with first-class support for modern .NET patterns. It provides strongly typed clients across API areas (chat, audio, images, embeddings, moderations, batches, files, models, vector stores, responses, realtime, assistants) and works with .NET Standard 2.0 while the examples use .NET 8. You install it via NuGet and authenticate with an API key, ideally through environment variables or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Sapiens

    Sapiens

    High-resolution models for human tasks

    Sapiens is a research framework from Meta AI focused on embodied intelligence and human-like multimodal learning, aiming to train agents that can perceive, reason, and act in complex environments. It integrates sensory inputs such as vision, audio, and proprioception into a unified learning architecture that allows agents to understand and adapt to their surroundings dynamically. The project emphasizes long-horizon reasoning and cross-modal grounding—connecting language, perception, and action into a single agentic model capable of following abstract goals. It includes simulation environments, datasets, and benchmarks for testing grounded understanding, imitation learning, and decision-making. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    PaddleSpeech

    PaddleSpeech

    Easy-to-use Speech Toolkit including Self-Supervised Learning model

    PaddleSpeech is an open-source toolkit on PaddlePaddle platform for a variety of critical tasks in speech and audio, with state-of-art and influential models. Via the easy-to-use, efficient, flexible and scalable implementation, our vision is to empower both industrial application and academic research, including training, inference & testing modules, and deployment process. Low barriers to install, CLI, Server, and Streaming Server is available to quick-start your journey. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Expo Orbit

    Expo Orbit

    Accelerate your development workflow with one-click build launches

    ​Expo Orbit is a desktop application developed by the Expo team to streamline the development workflow for React Native and Expo projects. It offers a user-friendly interface that allows developers to manage simulators and devices, install and launch builds, and handle updates with ease. Orbit supports various platforms, including macOS, Windows, and Linux, and integrates seamlessly with Expo Application Services (EAS) to facilitate efficient testing and deployment of applications.​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Matcha-TTS

    Matcha-TTS

    A fast TTS architecture with conditional flow matching

    Matcha-TTS is a non-autoregressive neural text-to-speech architecture that uses conditional flow matching to generate speech quickly while maintaining natural quality. It models speech as an ODE-based generative process, and conditional flow matching lets it reach high-quality audio in only a few synthesis steps, which greatly reduces latency compared to score-matching diffusion approaches. The model is fully probabilistic, so it can generate diverse realizations of the same text while still sounding stable and intelligible. The repository provides an end-to-end TTS pipeline: a PyTorch/Lightning training stack, configuration files, pre-trained checkpoints, a command-line interface, and a Gradio app for interactive testing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    NVIDIA Cosmos

    NVIDIA Cosmos

    NVIDIA Cosmos is an open platform of world models, datasets

    ...It includes model checkpoints, curated synthetic datasets, evaluation benchmarks, and code for research and deployment. Cosmos 3 expands the platform with omnimodal world models that can work across language, image, video, audio, and action sequences. Its main value is helping developers create AI systems that reason about physical spaces, predict outcomes, and generate realistic world data for training and testing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FFmate

    FFmate

    FFmate is a modern and powerful automation layer

    ...It allows users to perform tasks such as transcoding, trimming, and format conversion without needing to memorize command-line syntax. The tool dynamically generates FFmpeg commands based on user input, making complex workflows more accessible. It supports a wide range of audio and video formats, enabling flexible media processing. ffmate is designed for both beginners and advanced users, offering a balance between simplicity and customization. It can also be used for experimenting with encoding parameters and testing different configurations. Overall, it acts as a visual layer over FFmpeg for easier media manipulation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Transcoder

    Transcoder

    Hardware-accelerated video transcoding using Android MediaCodec APIs

    Transcoder by DeepMedia is an AI-powered video-to-video speech translation engine that enables fully automated multilingual dubbing. Unlike traditional speech translation systems that rely on multi-stage pipelines, Transcoder directly translates one speaker’s video into another language while preserving facial expressions, lip-sync, and vocal identity. Designed for real-time use and production-grade pipelines, Transcoder combines advanced deep learning models with GPU acceleration to deliver...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SongRec for Windows (Free Builds Mirror)

    SongRec for Windows (Free Builds Mirror)

    This page provides free access to Windows builds of SongRec.

    🔗 Official Project SongRec is an open-source application for music recognition. 👉 Official repository: https://github.com/marin-m/SongRec 👉 Official releases: https://github.com/marin-m/SongRec/releases 📦 About These Files The latest official Windows builds are now provided directly by the original maintainer. This page may host: Mirrors of official releases Historical versions for compatibility/testing Previously distributed builds for archival purposes ⚠️ Important...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 14
    Quimup

    Quimup

    Quimup is a Linux client for MPD

    QUIMUP is a client for the music player daemon (MPD) written in C++ and QT6. New version 2.1.0 fixes some problems and adds two new features: 1. Album art size can be set (within limits). 2. User actions to run commands or scripts on files or directories. DEB package for Debian (12 or testing) / Kubuntu 24.04 RPM packages for Fedora (40) and openSUSE (Tumbleweed) Tarball for manual installation of the binary. Tarball with source code.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 15
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    iTester

    iTester

    This application allows you to test audio devices using the sound card

    This application allows you to test audio devices using your computer's sound card. A stereo line-level audio input is required, and support for sampling rates of 192 kHz or higher is desirable. The impulse tester is designed to study changes that occur in a non-periodic signal when passing through the electronic device being tested. Testing is performed by generating a single pulse, recording the original pulse shape on one channel, recording the modified pulse shape on another channel, and then comparing the shapes of the original and tested signals.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Memorize

    Memorize

    A Qt vocabulary app 'Memorize' with word management, multiple testing

    ...The software provides a user-friendly interface where learners can add custom vocabulary entries including words, phonetic transcriptions, parts of speech, meanings, and audio pronunciations. Key features include a flashcard system for spaced repetition practice, interactive testing modes to assess learning progress, and detailed statistics with visual charts to track performance over time. The application supports text-to-speech functionality, allowing users to listen to word pronunciations for improved auditory learning.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Video of Death
    Video of Death is a nonlinear video editing software. You must install FFmpeg. 5-9-26: Found a bit of a bug, I'm not going to chase. If there is no visual in the timeline, audio won't play by it's self. 5-15-26: Fixed video/audio sync bug. Preview sync isn't perfect, but exported videos have perfect video/audio sync. 5-22-26: Had a major issue on exports, that didn't show in testing. Fixed. Also fixed audio sync in preview, as well as source window functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    EarQuiz Frequencies

    EarQuiz Frequencies

    Software for technical ear training on equalization

    ...The overall training process involves ongoing learning and testing yourself. In the Learn mode, you listen to the pink noise or music (or other external audio) excerpts with switched off and on 1-octave or 1/3-octave graphic EQ, boosting or cutting frequency bands within certain spectral ranges. Then in the Test mode you are given a sequence of 10 similar examples, where you try to guess, which frequencies are boosted or cut, and you get scored.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    q4rescue

    q4rescue

    A live linux Rescue toolkit/Emergency OS - based on q4os Trinity

    A live linux system rescue toolkit based on q4os Trinity available as a bootable iso for administrating, repairing and cloning/restoring your system and data. Check wiki for full description : https://sourceforge.net/p/q4rescue/wiki/ Main tools: -Foxclone -Rescuezilla -Clonezilla -DDrescue-gui -qtfsarchiver -G4L -Apart -Testdisk -Photorec -Boot Repair -WoeUSB -Q4OS imager -UNetbootin -usbimager -Kdirstats -Kdiskmark -Rclone & Rclone...
    Downloads: 71 This Week
    Last Update:
    See Project
  • 21
    DragonOS
    *Until you install the operating system, the default user = live / no password. DragonOS Noble (24.04) DragonOS FocalX (22.04) and DragonOS Focal (20.04) are out-of-the-box Lubuntu based x86_64 operating systems for anyone interested in software defined radios. All source installed software is located in the /usr/src directory while the remaining software was installed by package managers. What is DragonOS and why do you want it? The shortest distance between two points is a...
    Leader badge
    Downloads: 1,210 This Week
    Last Update:
    See Project
  • 22
    Quiz/Survey/Test - QST

    Quiz/Survey/Test - QST

    A Free, complete, open source universal assessment/exam platform

    QST, the worlds unparalleled open source, multi-tenant, online/lan assessment software. From a quick quiz on your phone to very large scale, high stakes, proctored desktop testing, we make it easy/secure/economical. Our intuitive design contains features (Immediate detailed results, Create/Export/Import/Convert Questions, WYSIWYG/Math-Chemistry/Basic Editors, Question/Item Bank, Multiple Question Types, Multiple Delivery Styles, Multiple Delivery/Results Options, Adaptive/Branching...
    Leader badge
    Downloads: 49 This Week
    Last Update:
    See Project
  • 23
    Pearl Linux 13 (Preslee)

    Pearl Linux 13 (Preslee)

    Complete Audio Workstation on Rolling Release

    For those who want the most current releases of their favorite software. Easily manage your repositories between bookworm, testing as well as unstable and experimental with Pearl Sources. Both XFCE and MATE 64 bit Desktop Environments just released. Running the Debian Bookworm, Debian's next Release, as the base. Enjoy all the standard Pearl Apps including update manager and the software manager with OS X and Windows layouts and theming + many new Pearl Apps. We've included and...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 24

    Objective-Oriented Directivity

    MATLAB toolbox for processing directivity models

    The project is a framework developed in the form of a MATLAB toolbox, which aims to bring common interface for various directivity representations in acoustics. The legacy version was described in paper 10521 at 151st Audio Engineering Society Convention (https://arxiv.org/abs/2109.14370). The preprint on the current, improved version, can be found here: https://arxiv.org/abs/2206.12283. Currently not submitted anywhere, please refer to the toolbox by citing this website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    audio-msg

    Storing audio messages in a chain of urls.

    For testing purposes, 'pcode' files executable in Matlab are distributed. Matlab is needed to record, upload, download and play the audio msgs. --- Audio frames are hidden in url-strings stored on name-servers. Audio frames are linked using the hash of neighbouring frames. The audio file is restored (downloaded) by knowing the last hash.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
Auth0 Logo