Showing 19 open source projects for "audio extract"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    FFsubsync

    FFsubsync

    Automagically synchronize subtitles with video

    ...In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 2
    YouTube Music Downloader

    YouTube Music Downloader

    A simple app to get songs from YouTube in mp3 format with artist name

    YouTube Music Downloader is a command-line music downloader written in Python that retrieves audio from YouTube and enriches it with detailed metadata from external sources. It combines tools like yt-dlp and FFmpeg to extract high-quality audio while automatically tagging files with artist name, album, release date, and artwork. The application distinguishes itself by integrating metadata providers such as Spotify and iTunes, ensuring that downloaded tracks resemble properly organized music library entries. ...
    Downloads: 50 This Week
    Last Update:
    See Project
  • 3
    pyAudioAnalysis

    pyAudioAnalysis

    Python Audio Analysis Library: Feature Extraction, Classification

    pyAudioAnalysis is an open-source Python library designed for audio signal analysis, machine learning, and music information retrieval tasks. The project provides a collection of tools that allow developers to extract meaningful features from audio files and use those features for classification, segmentation, and analysis. The library supports multiple audio processing workflows, including feature extraction from raw audio signals, training of machine learning models, and automatic audio segmentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 72 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AudioMuse-AI

    AudioMuse-AI

    AudioMuse-AI is an Open Source Dockerized environment

    ...AudioMuse-AI integrates with several popular self-hosted music servers including Jellyfin, Navidrome, and Emby, allowing users to extend existing media servers with advanced AI-powered recommendation capabilities. The system uses machine learning and audio analysis tools such as Librosa and ONNX models to extract features directly from audio tracks.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    BlogWizard

    BlogWizard

    Generate blog articles from video or audio

    BlogWizard is a demo/utility project built on top of Groq’s LLM infrastructure that converts video or audio content into well-structured blog posts, enabling creators to repurpose multimedia content into text — useful for SEO, accessibility, or reaching audiences that prefer reading. The tool uses transcription (e.g. via Whisper) to extract text from audio/video, then runs an LLM-based generation pipeline to transform that content into coherent, readable blog-format posts — with sections, formatting, and possibly metadata. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ChatTTS webUI & API

    ChatTTS webUI & API

    A simple native web interface that uses ChatTTS to synthesize text

    ...For convenience, there is a prepackaged Windows build: you download a release archive, extract it, and double-click app.exe to start the web UI, which opens on localhost:9966.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    Anything to NotebookLM

    Anything to NotebookLM

    Multi-source content processor for NotebookLM

    Qiaomu Anything to NotebookLM is a Claude Code skill that turns many types of source material into structured NotebookLM-ready outputs. It is built for users who want to convert articles, web pages, videos, PDFs, office files, podcasts, images, and search results into more usable study or presentation formats. The project uses natural-language commands, so the user can ask for a podcast, slide deck, mind map, report, quiz, flashcards, or infographic without manually building the workflow. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    OmniPull

    OmniPull

    Just pull anything

    OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 11
    MahaKurawa.My.ID MP4 VA Extract

    MahaKurawa.My.ID MP4 VA Extract

    MahaKurawa.My.ID MP4 VA Extract is a tool to extract mp4 file content

    MahaKurawa.My.ID MP4 VA Extract is a tool to extract MP4 file video and audio content. It also have ability to extract MKV file and single SSA Subtitle file. This software will not convert any video and audio file from MP4 file. This software just extract them as it is. This tool is made for that specific purpose. This tool "MahaKurawa.My.ID MP4 VA Extract v.1.0.3.1" can be obtained for free on https://www.mahakurawa.my.id.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MahaKurawa MP4V-A Extractor

    MahaKurawa MP4V-A Extractor

    This software is a tool to extract video and audio file that contained

    This software is a tool to extract video and audio file that contained by a .MP4 format. This software will not convert any video and audio file from yout .mp4 file. This software just extract them as it is. This tool is made for that specific purpose. This tool "MahaKurawa MP4 V-A Extractor V.10" can be obtained for free on https://www.mahakurawa.my.id.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    VoiceFixer

    VoiceFixer

    General Speech Restoration

    VoiceFixer is a machine-learning framework for “speech restoration”: given a degraded or distorted audio recording — with noise, clipping, low sampling rate, reverberation, or other artifacts — it attempts to recover high-fidelity, clean speech. The architecture works in two stages: first an analysis stage that tries to extract “clean” intermediate features from the noisy audio (e.g. removing noise, denoising, dereverberation, upsampling), and then a neural vocoder-based synthesis stage that reconstructs a high-quality waveform from those features. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    pytube

    pytube

    A lightweight, dependency-free Python library

    Pytube is a lightweight, dependency-free Python library that enables downloading YouTube videos and audio streams with minimal setup. It supports video resolution selection, progressive or adaptive streams, and caption downloads. Pytube is ideal for automation scripts, archiving tools, and media applications that need to interface with YouTube content programmatically.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Spatial Media

    Spatial Media

    Specifications and tools for 360º video and spatial audio

    spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is present. ...
    Downloads: 60 This Week
    Last Update:
    See Project
  • 16
    uncaptcha

    uncaptcha

    Defeating Google's audio reCaptcha with 85% accuracy

    uncaptcha is an open-source proof-of-concept system designed to demonstrate vulnerabilities in Google’s audio reCAPTCHA challenges by automatically solving them using speech recognition techniques. The project uses browser automation to navigate to CAPTCHA challenges, extract audio files, and process them through multiple speech-to-text services. By combining outputs from several transcription engines, the system increases the likelihood of correctly identifying the spoken digits or phrases required to solve the challenge. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    EnDiskEx

    Bulk extractor for Ensoniq-formatted disk images

    EnDiskEx is a command-line tool that bulk extracts instruments, sequences, songs, and banks from Ensoniq-formatted disk images (RAW, GKH, EDE, and EDA) for the EPS/ASR family of samplers. The extracted files are saved as EFE / SMF / TXT files. EnDiskEx is designed to extract Ensoniq banks for re-creation within a different DAW. It will track down the instrument and song files from bank references even if they were saved on another disk. There also exists a disk mapping feature to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Pronac MediaMonkey Extension

    Pronac MediaMonkey Extension

    Recommends music based upon your current taste.

    A music recommendation engine. It is meant to be an add-on for popular media players like Winamp, Amarok, Rhythmbox or Banshee. Currently supports only MediaMonkey Player. Downlaod, extract and run "pronac.exe". Play the first song from the Now Playing list, it'll recommend you next songs from the same list. NOTE: MAKE SURE THAT SONG SHUFFLE IS TURNED OFF WHILE USING PRONAC. Based upon K-Nearest Neighbor Machine Learning Algorithm, K-Fold Cross Validation and EchoNest for audio features.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This is a wrapper for ffmpeg used to extract audio from flash videos encountered while running firefox. It's meant to provide a simple and most importantly quick way of ripping music from online flash videos. Read the desc in source for use notes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB