Showing 757 open source projects for "sound"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 1
    The Arcade Library

    The Arcade Library

    Easy to use Python library for creating 2D arcade games

    Arcade is an easy-to-use Python library for creating 2D video games. It provides a modern and straightforward API, enabling developers to craft engaging games and graphical applications efficiently. Arcade supports rendering shapes, handling user input, and managing game physics, making it suitable for both beginners and experienced developers.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    sound_source_id

    sound_source_id is a Python program for sound localization experiments

    sound_source_id is a Python program for sound localization experiments. Further info can be found on the project webpage: https://samcarcagno.altervista.org/sound_source_id/sound_source_id.html
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AI Chatbot Framework

    AI Chatbot Framework

    Python chatbot framework with Natural Language Understanding

    Building a chatbot can sound daunting, but it’s totally doable. AI Chatbot Framework is an AI powered conversational dialog interface built in Python. With this tool, it’s easy to create Natural Language conversational scenarios with no coding efforts whatsoever. The smooth UI makes it effortless to create and train conversations to the bot and it continuously gets smarter as it learns from conversations it has with people.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5
    Auto-Deep-Research

    Auto-Deep-Research

    Your Fully-Automated Personal AI Assistant

    ...Auto-Deep-Research integrates retrieval from academic and web sources, processes document corpora for relevance and key insights, and organizes outputs into coherent chapters or sections according to research standards. It also embeds validation loops, where intermediate drafts are self-checked for consistency, coverage, and alignment with sound reasoning practices, reducing reliance on raw generation alone.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    SenseVoice

    SenseVoice

    Multilingual speech recognition and audio understanding model

    ...It includes different model variants optimized for either speed or accuracy, allowing developers to choose a configuration suitable for their use case. In addition to speech transcription, SenseVoice can detect emotional cues in speech and identify common sound events such as applause, laughter, or coughing. It also provides tools for running inference, exporting models to formats like ONNX or LibTorch, and deploying the system through APIs.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    OSWorld

    OSWorld

    Benchmarking Multimodal Agents for Open-Ended Tasks

    OSWorld is an open-source synthetic world environment designed for embodied AI research and multi-agent learning. It provides a richly simulated 3D world where multiple agents can interact, perform tasks, and learn complex behaviors. OSWorld emphasizes multi-modal interaction, enabling agents to process visual, auditory, and symbolic data for grounded learning in a simulated world.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    pyAudioAnalysis

    pyAudioAnalysis

    Python Audio Analysis Library: Feature Extraction, Classification

    ...The library supports multiple audio processing workflows, including feature extraction from raw audio signals, training of machine learning models, and automatic audio segmentation. It also includes utilities for visualizing audio features and analyzing patterns within sound recordings, which can be useful in applications such as speech recognition, music classification, and acoustic event detection. Because the library integrates machine learning algorithms with signal processing tools, it enables researchers to develop complete audio analysis pipelines using a single framework.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    SleepFM-Clinical

    SleepFM-Clinical

    Improve human sleep through scientifically

    ...Rather than simply playing static white noise or ambient tracks, it uses a closed-loop, frequency-modulated framework that responds to user-specific sleep patterns and physiological signals to tailor sound in ways that can enhance sleep onset and depth. The clinical release includes additional features for controlled experimentation, such as logging capabilities, adjustable parameter sets, and protocols suitable for sleep studies and therapeutic settings. It also integrates tools for clinicians to configure sessions, annotate events, and potentially link with biofeedback data, enabling a more nuanced understanding of sound’s effect on sleep architecture over time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    ImageBind

    ImageBind

    ImageBind One Embedding Space to Bind Them All

    ...Instead of aligning each pair independently, ImageBind uses image data as the central binding modality, aligning all other modalities to it so they can interoperate zero-shot. This creates a unified embedding space where representations from any modality can be compared or retrieved against any other (e.g., matching sound to text or depth to image). The model is trained using large-scale contrastive learning, leveraging diverse datasets from natural images, videos, audio clips, and sensor data. Once trained, it can perform cross-modal retrieval, zero-shot classification, and multimodal composition without additional fine-tuning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TerraGov Marine Corps

    TerraGov Marine Corps

    TGMC: TerraGov Marine Corps, a SS13 mod

    TerraGov Marine Corps (TGMC) is an open source multiplayer game built on the BYOND engine, forked from the Space Station 13 (SS13) codebase. It is a tactical, role-playing game that pits groups of human marines against alien forces in large-scale, cooperative and competitive scenarios. The project focuses heavily on teamwork, coordination, and immersive gameplay, providing players with different roles such as engineers, medics, or combat marines to ensure strategic variety. TGMC offers a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OpenaiBot

    OpenaiBot

    Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant

    If you don't have the instant messaging platform you need or you want to develop a new application, you are welcome to contribute to this repository. You can develop a new Controller by using Event.py. Compatibility with multiple LLMs and integration with GPT and third-party systems is handled by our llm-kira project on GitHub. It can accurately limit billing, with limits and ID binding. Supports asynchronous operations and can handle multiple requests simultaneously. Allows for private and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 14
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    Qwen-Audio is a large audio-language model developed by Alibaba Cloud, built to accept various types of audio input (speech, natural sounds, music, singing) along with text input, and output text. There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio. It uses multi-task training over many different audio tasks (30+), and achieves strong multi-benchmarks performance...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    HuMidi

    HuMidi

    Play MIDI like a human in ROBLOX with automatic sustain pedals.

    HuMidi is a universal piano auto player for ROBLOX. It plays even the most generic MIDI file with great depth, thanks to the humanization and automatic sustain pedal generation algorithms. The pedals are generated with a thorough analysis of the MIDI data, adding more depth to your performance that no other MIDI players could! Personally have been tested in: - Visual Pianos - Starving Pianists - Digital Piano - Piano's Got Talent This tool is universal, as long as the piano in-game...
    Leader badge
    Downloads: 335 This Week
    Last Update:
    See Project
  • 16
    EasyABC

    EasyABC

    EasyABC is an open source ABC editor

    EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs...
    Leader badge
    Downloads: 244 This Week
    Last Update:
    See Project
  • 17
    SonicDive-8D-Music-Player

    SonicDive-8D-Music-Player

    SonicDive 8D Music Player v-1.0

    SonicDive is an immersive audio visualization & effects-based music player designed to deliver a next-level listening experience. It combines dynamic spectrums with advanced spatial audio effects like 3D & 8D sound. ✨ Features 🎵 Audio Visual Spectrums SonicDive supports multiple real-time audio visualizations: 💿 Disk Spectrum 📊 Bars Spectrum 🌊 Wave Spectrum 🖼️ Thumbnail Spectrum ⭕ Circle Spectrum Each spectrum reacts dynamically to the music’s frequency and intensity. 🎚️ Audio Effects & Modes Choose from a variety of sound profiles to match your mood: 🔊 Flat 🎧 3D Audio 🎧 8D Audio 🎤 Hip-Hop 🎻 Classic 🎸 Rock 🎥 Dolby Effect
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Media Downloader

    Media Downloader

    A simple tool to download video and audio form internet sources

    This software enables users to download video and audio content from a variety of supported websites for personal and educational use. It is free and open source, promoting transparency, flexibility, and community-driven development.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 19
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder. All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by...
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 20
    OpenPiano — Virtual Piano for Windows

    OpenPiano — Virtual Piano for Windows

    Desktop piano playable with a PC keyboard, mouse, or MIDI device.

    OpenPiano is a Windows desktop piano application that allows you to play, practice, and record music using your PC keyboard, mouse, or a MIDI device. It supports real-time playback using SoundFonts and provides on-screen piano layouts for visual feedback while playing. OpenPiano is designed to run entirely locally. It does not require accounts, cloud services, or an internet connection for core functionality. Project links: Website:...
    Leader badge
    Downloads: 133 This Week
    Last Update:
    See Project
  • 21
    SPPAS

    SPPAS

    SPPAS - the automatic annotation and analyses of speech

    ...Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. SPPAS offers a file converter from/to a wide range of formats: xra, TextGrid, eaf, trs... <https://sppas.org>
    Downloads: 17 This Week
    Last Update:
    See Project
  • 22
    Random-jukebox

    Random-jukebox

    Project to hear some random music play

    Downloads: 4 This Week
    Last Update:
    See Project
  • 23

    MP3FileStructureAnalyser

    Analyse a MP3 file structure (ID3 and frame headers) into JSON objects

    The project contains a small command line tool (written in Python) that analyses the file structure (ID3 and frame headers) of MP3 files and projects the structure into a JSON object.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    PyTube Downloader

    PyTube Downloader

    Let's quickly download YouTube videos & playlists with one click.

    PyTube Downloader lets you quickly download YouTube videos and playlists with one click. Choose from 144p to 8K quality and download multiple videos simultaneously. PyTube Downloader 让你轻松一键下载 YouTube 视频和播放列表。支持选择 144p 到 8K 的质量,并支持同时下载多个视频。
    Downloads: 134 This Week
    Last Update:
    See Project
  • 25
    UMD

    UMD

    Universal Multimedia Downloader, URL To Video/Audio. (UMD)

    ⚠️ PROJECT DEPRECATED ⚠️ Universal Media Downloader (UMD) is no longer maintained. This project has been officially replaced by MediaCrate, a better, actively maintained successor with improved performance, stability, and features. ➡ Download MediaCrate here: https://sourceforge.net/projects/mediacrate
    Downloads: 10 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB