audio test free download

Showing 13 open source projects for "audio test"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

FastKoko

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

...A built in web UI, API documentation, and debug endpoints for monitoring system status help users explore voices, test requests, and integrate the service into larger systems.

Downloads: 1 This Week

Last Update: 2025-12-13
See Project
2

gTTS

Python library and CLI tool to interface with Google Translate

...A small CLI utility, gtts-cli, makes it easy to test or batch-generate MP3 files right from the shell.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
3

DiscordGo

(Golang) Go bindings for Discord

DiscordGo is a Go package that provides low level bindings to the Discord chat client API. DiscordGo has nearly complete support for all of the Discord API endpoints, websocket interface, and voice interface. If you would like to help the DiscordGo package please use this link to add the official DiscordGo test bot dgo to your server. This provides indispensable help to this project. Construct a new Discord client which can be used to access the variety of Discord API functions and to set...

Downloads: 0 This Week

Last Update: 2025-05-24
See Project
4

CSM (Conversational Speech Model)

A Conversational Speech Generation Model

The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.

Downloads: 7 This Week

Last Update: 2025-03-19
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
5

AI File Sorter

Local AI file organization with categorization and rename suggestions

AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...

Downloads: 236 This Week

Last Update: 2026-03-22
See Project
6

OpenAI Web Application

A web application that allows users to interact with OpenAI's models

A web application that allows users to interact with OpenAI's modles through a simple and user-friendly interface. This app is for demo purpose to test OpenAI API and may contain issues/bugs. User-friendly interface for making requests to the OpenAI API. Responses are displayed in a chat-like format. Select Models (Davinci, Codex, DALL·E, Whisper) based on your needs. Create AI Images (DALL·E). Audio-Text Transcribe (Whisper). Highlight code syntax. Type in the input field and press enter or click on the send button to make a request to the OpenAI API. ...

Downloads: 1 This Week

Last Update: 2023-03-23
See Project
7

wav2letter++

Facebook AI research's automatic speech recognition toolkit

...After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. A sample is specified using 4 columns separated by space (or tabs).

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
8

Scalable Language API

Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.

Downloads: 0 This Week

Last Update: 2018-01-22
See Project
9

openEAR

openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.

4 Reviews

Downloads: 9 This Week

Last Update: 2015-08-06
See Project
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
10

Distributed Artificial Intelligence

DAI = Distributed Artificial Intelligence The projected is intended to be a test bed for AI related concepts and technologies, not necessarily an end user product, though that could change. Some of the modules can be modified for other uses.

Downloads: 0 This Week

Last Update: 2016-03-15
See Project
11

OpenAI Realtime Console

React app for inspecting, building and debugging with the Realtime API

...The Realtime API enables low-latency, interactive communication with language models, supporting use cases such as live conversations, real-time transcription, and interactive applications. This console serves as a reference implementation, showing how to establish WebRTC or WebSocket connections, send audio or text inputs, and receive model outputs in real time. It is built as a simple frontend that developers can run locally to test and understand how Realtime API interactions work. The project is intended as an educational and debugging resource rather than a production-ready application. By offering clear examples of streaming inputs and outputs, the console helps developers accelerate prototyping of real-time AI-powered applications.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
12

wav2vec2-large-xlsr-53-portuguese

Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input

wav2vec2-large-xlsr-53-portuguese is an automatic speech recognition (ASR) model fine-tuned on Portuguese using the Common Voice 6.1 dataset. It is based on Facebook’s wav2vec2-large-xlsr-53, a multilingual self-supervised learning model, and is optimized to transcribe Portuguese speech sampled at 16kHz. The model performs well without a language model, though adding one can improve word error rate (WER) and character error rate (CER). It achieves a WER of 11.3% (or 9.01% with LM) on Common...

Downloads: 0 This Week

Last Update: 2025-07-01
See Project
13

wav2vec2-large-xlsr-53-russian

Russian ASR model fine-tuned on Common Voice and CSS10 datasets

wav2vec2-large-xlsr-53-russian is a fine-tuned automatic speech recognition (ASR) model based on Facebook’s wav2vec2-large-xlsr-53 and optimized for Russian. It was trained using Mozilla’s Common Voice 6.1 and CSS10 datasets to recognize Russian speech with high accuracy. The model operates best with audio sampled at 16kHz and can transcribe Russian speech directly without a language model. It achieves a Word Error Rate (WER) of 13.3% and Character Error Rate (CER) of 2.88% on the Common Voice test set, with even better results when used with a language model. The model supports both PyTorch and JAX and is compatible with the Hugging Face Transformers and HuggingSound libraries. ...

Downloads: 0 This Week

Last Update: 2025-07-01
See Project