Showing 121 open source projects for "sound recognition library"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details).
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    spotDL

    spotDL

    Download your Spotify playlists and songs along with album art

    spotDL is a command-line tool that allows users to download songs and playlists from Spotify by sourcing the audio from YouTube. Built in Python, it automatically matches Spotify tracks with corresponding videos on YouTube and downloads them with embedded metadata. The tool retrieves important information such as album art, song titles, artist names, and lyrics to organize downloaded files. spotDL is designed to be fast, accurate, and easy to use through a simple command-line interface. It...
    Downloads: 102 This Week
    Last Update:
    See Project
  • 3
    Quod Libet

    Quod Libet

    Music player and music library manager for Linux, Windows, and macOS

    Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet. Quod Libet is a GTK+-based audio player written in Python, using the...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Audiomentations

    Audiomentations

    A Python library for audio data augmentation

    ...Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where background noise is present. A folder of (background noise) sounds to be mixed in must be specified. These sounds should ideally be at least as long as the input sounds to be transformed. Otherwise, the background sound will be repeated, which may sound unnatural.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    scikit-image

    scikit-image

    Image processing in Python

    scikit-image is a collection of algorithms for image processing. It is available free of charge and free of restriction. We pride ourselves on high-quality, peer-reviewed code, written by an active community of volunteers. scikit-image builds on scipy.ndimage to provide a versatile set of image processing routines in Python. This library is developed by its community, and contributions are most welcome! Read about our mission, vision, and values and how we govern the project. Major proposals...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Swing Music

    Swing Music

    Swing Music is a beautiful, self-hosted music player

    Swing Music is a beautiful, self-hosted music player and streaming server that lets you bring your personal audio library online with a modern browser-based interface, giving you a private alternative to mainstream streaming services. Designed to be both elegant and powerful, the project scans your local music files (like MP3s or FLACs), organizes metadata, and streams them on-demand to any device with a browser or its Android client. It includes features like folder browsing, playlist...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Tauon

    Tauon

    The music player of today

    Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a...
    Downloads: 5 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 11
    Free Karaoke File Maker

    Free Karaoke File Maker

    Free Karaoke File Maker

    You can hide the singer's voice from the music files that cannot hide the voice in the computer. By default, it will be saved with 2 audio tracks of singer + melody. If you want to save only the melody without the singer's voice, you have to select the No Vocal option. To save the output file, click Save Folder and choose the location you want to save (Default: Desktop). If you are sure of the above preparations, you can change the file you want to change by holding down the mouse and...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12
    The Sound Description Interchange Format (SDIF) is an established standard for the interchange of sound descriptions and analysis data. This project provides libraries, SDIF (in C) and Easdif (in C++), tools, and wrappers to read and write SDIF files.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PC Musique Player

    PC Musique Player

    Music player for Windows with rich features and themes

    PC Musique Player has 23 different themes to change the player background and UI control background. It has (Play/Pause/Resume) in a single button, Previous, Next, Select library folder, shuffle, loop button(loop ON/loop OFF/loop All), stop, volume increase, volume decrease, mute. Choose any folder to play the audio files from the folder as library. To play audio files in the library listed towards the right side of the app, double click on it. Also use the <space> bar in your keyboard to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    AudiooPy

    AudiooPy

    Audio manager in Python Object-Oriented Programming

    AudiooPy stands for "Audio Manager in Python Object-Oriented Programming." This library provides a range of useful operations for sound files and audio fragments. It processes audio at the frame level, working with signed integer samples of 8, 16, or 32 bits, stored in byte-like objects. Key features include: - Reading and writing WAV files using Python's standard library. - A scientifically validated method for automatically detecting sound segments in speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Midi_Master_Clock

    A simple command line Midi Master Clock

    MIDI Master Clock: Simple MIDI Song and Control Change Manager MIDI Master Clock is a lightweight, command-line utility for effortless MIDI device control and synchronization. It lets you define songs or cues in a simple text file, each with its own BPM and custom MIDI Control Change (CC) messages. What It Does: Reads Song Data: Configure song titles, BPM, and MIDI CCs (e.g., CC_ch0 10) in an easy-to-read data.txt file. Interactive Control: Select songs from a list, advance through...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LAME (Lame Aint an MP3 Encoder)

    LAME (Lame Aint an MP3 Encoder)

    A high quality MP3 encoder

    LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!
    Leader badge
    Downloads: 20,797 This Week
    Last Update:
    See Project
  • 17
    Pyst consists of a set of interfaces and libraries to allow programming of Asterisk from python. The library currently supports AGI, AMI, and the parsing of Asterisk configuration files. The library also includes debugging facilities for AGI. 2014-04-17: Moved the version control to GIT. To check out see the tab "Code". Note that the whole history including ancient CVS, then some time in monotone, then subversion was united into one GIT repository thanks to ESR's...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Spleeter

    Spleeter

    Deezer source separation library including pretrained models

    Spleeter is the Deezer source separation library with pretrained models written in Python and using Tensorflow. It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x...
    Downloads: 51 This Week
    Last Update:
    See Project
  • 19

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    Swami Project
    A SoundFont editor and other software for editing, managing and sharing sample based MIDI instrument files for computer music composition. Support for other formats is planned.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 22
    AngelReader

    AngelReader

    An E-book, Audio-book, & Library Loader in One Application

    AngelReader: A minimalist but powerful GUI application that has the capacity to load [1] E-books in plain text format with the least use of both software and hardware resources. It can also load [2] Audio-books with the basic functions of play, stop, pause, and resume with the same minimalist economy that doesn't hog computer resources. When used in integration with the AngelReader Library Selector, it can function as a mini library management system for books in electronic formats. It's in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    DreamStudio by Volocian Studios
    Volocian believes first and foremost that financial status shouldn’t be a barrier to entry in multimedia content creation. Whether you want to design a product or website, record a band, remix a song, or produce a feature length movie with professional post production graphics and effects, Volocian™ wants to help, even if you’re using borrowed hardware with no previous experience. We provide affordable products and services for any budget, including cost-free solutions for education and demo...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 25

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB