sound api free download

Showing 26 open source projects for "sound api"

View related business solutions

Speech Windows Clear Filters & Widen Search

$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...

Downloads: 21 This Week

Last Update: 2026-03-23
See Project
2

Buzz

Transcribe and translate audio offline on your personal computer

Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API. Get linux versions from: - https://flathub.org/apps/io.github.chidiwilliams.Buzz - https://snapcraft.io/buzz Home page of Buzz https://github.com/chidiwilliams/buzz Note for...

1 Review

Downloads: 4,692 This Week

Last Update: 2026-03-14
See Project
3

Simple TTS Reader

A small clipboard reader

Simple TTS Reader is a small utility that reads text from your clipboard using Microsoft Speech API. Whenever you copy any text, the app instantly converts it into spoken words. Select your preferred speech engine from those installed on your system, such as Microsoft Zira, and adjust speed and volume for personalized playback. The application can also be minimized to the system tray. Plus, it is free and comes with an intuitive interface that makes it accessible to everyone.

4 Reviews

Downloads: 95 This Week

Last Update: 2025-10-27
See Project
4

XR3Player

Dominant JavaFX Advanced Media Player

This project is on Github now : https://github.com/goxr3plus/XR3Player

1 Review

Downloads: 1 This Week

Last Update: 2019-11-04
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

Mumble

Low-latency, high quality voice chat for gamers

Mumble is an open source, low-latency, high quality voice chat software primarily intended for use while gaming. It includes game linking, so voice from other players comes from the direction of their characters, and has echo cancellation so the sound from your loudspeakers won't be audible to other players.

169 Reviews

Downloads: 158 This Week

Last Update: 2022-01-22
See Project
6

srt-translator

Subtitle translator from one natural language to other.

Translating subtitles in format SubRip from one natural language to other. It is based on Google Translate without API and therefore without payment. Translator have automatic and manual spell checkers.

Downloads: 11 This Week

Last Update: 2016-07-19
See Project
7

Modular Audio Recognition Framework

MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.

3 Reviews

Downloads: 0 This Week

Last Update: 2015-10-06
See Project
8

Java Speech API

Wrapper for vendors to simplify usage of the Java Speech API (JSR 113). Note that the spec is an untested early access and that there may be changes in the API.

2 Reviews

Downloads: 5 This Week

Last Update: 2014-12-12
See Project
9

Steel TTS

A cross-platform wrapper for common text-to-speech engines in Python

Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead...

Downloads: 4 This Week

Last Update: 2016-03-15
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

Voce

A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.

3 Reviews

Downloads: 0 This Week

Last Update: 2013-10-03
See Project
11

Mixed Excitation hts_engine API

Adds mixed excitation to the hts_engine API

Adds mixed excitation to the hts_engine API

Downloads: 1 This Week

Last Update: 2013-05-30
See Project
12

pjsip-jni

A JNI wrapper for pjsip. You can use this wrapper to develop Java applications using the pjsip library. At the moment only the pjsua API is implemented. If you would like to obtain a commercial license, or need customisations, please contact us.

Downloads: 0 This Week

Last Update: 2015-08-06
See Project
13

Arabisc

Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab

1 Review

Downloads: 0 This Week

Last Update: 2013-04-26
See Project
14

flashcards (granule)

GRANULE is a flashcards program based on Leitner cardfile methodology for learning new words. It features long-term memory training capabilities with scheduling, integrated pictures, sound, and full-screen mode.

Downloads: 10 This Week

Last Update: 2012-08-25
See Project
15

Scalable Language API

Scalable Language API (SLAPI) The most comprehensive architecture for conversational natural-language applications including speech recognition/synthesis, semantics, & machine translation. Used on Android & other mobile app platforms.

Downloads: 0 This Week

Last Update: 2018-01-22
See Project
16

Audacity-Extra

dark themed version of free Audacity sound editor

audacity-extra now provides a sleek dark themed version of the Audacity open source sound editor. The project experiments with Audacity variations. There's a vowel-sound target-practice display for language learners and an analog waveform data logger for embedded systems.

1 Review

Downloads: 0 This Week

Last Update: 2016-01-20
See Project
17

Auvai Text to Speech

Auvai is a Java API and Java Swing based application for Text to Speech conversion of Unicode Tamil. Future direction of this API and application is to support Text to Speech conversion for all "Indic" languages.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
18

Tools for Field Linguistics

This site is devoted to the collaborative creation of tools, protocols and procedures for field linguistics and language analysis. We are especially interested in tools for annotating or manipulating text, audio and video-based language archives.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project
19

Lucidium Application Platform

Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.

Downloads: 0 This Week

Last Update: 2013-04-08
See Project
20

Easy access to MS Speech API 4 and 5

The Open Source [GNU GPL} library writed in Delphi, who provide easy access to MS Speech API (SAPI4 and SAPI5 like one) COM interface. The source code have sample to call it library for Delphi, Assembler, C#, C, Lasarus and FreeBasic.

1 Review

Downloads: 3 This Week

Last Update: 2015-08-04
See Project
21

MRCP4J

The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.

Downloads: 0 This Week

Last Update: 2013-04-25
See Project
22

AIBO Pal

A speech recognition application. It uses Microsoft Speech SDK to recognize and speak words. It can Play Music, Read the News, Tell the Time, Open Apps and many other cool things only with voice commands.

Downloads: 1 This Week

Last Update: 2015-05-22
See Project
23

Civil Defence

(rus) Civil Defence - цель проекта реализовать комплексное решение для организации библиотек для незрячих людей. (eng) Civil Defence - library for blind.

Downloads: 0 This Week

Last Update: 2015-11-11
See Project
24

Distributed Artificial Intelligence

DAI = Distributed Artificial Intelligence The projected is intended to be a test bed for AI related concepts and technologies, not necessarily an end user product, though that could change. Some of the modules can be modified for other uses.

Downloads: 0 This Week

Last Update: 2016-03-15
See Project
25

SR Media Player

The Speech Recognition Media Player is designed to browse and play your music and videos only with your voice. Plug in a remote microphone to your PC and use it as a Remote Control. Really helpful for the visually handicapped.

Downloads: 0 This Week

Last Update: 2013-04-24
See Project