Search Results for "audio source separation" - Page 2

Showing 363 open source projects for "audio source separation"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Music-bot

    Music-bot

    A complete code to download for a cool Discord music bot

    Music-bot is a Discord bot designed to stream and manage music playback within voice channels, providing users with an interactive audio experience. It supports playing music from various online sources, including streaming platforms and direct URLs. The bot includes queue management features that allow users to add, remove, and reorder tracks during playback. It integrates audio processing tools to ensure smooth streaming and consistent playback quality. Music-bot also supports commands for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    JsAction

    JsAction

    JsAction is a small event delegation library

    JSAction is a JavaScript framework developed by Google that provides a structured, event-driven architecture for managing user interactions in large-scale web applications. It simplifies event handling by declaratively binding actions to DOM elements through HTML attributes, enabling clean separation between markup and behavior. JSAction helps improve performance, maintainability, and reliability by minimizing the use of inline scripts and global event listeners. It is especially useful in...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MediaElement.js

    MediaElement.js

    A dependable HTML media framework

    Boost your player with Chromecast, Google Analytics and more. MediaElement.js is a blazingly fast and amazingly powerful HTML5 audio and video library that creates a unified feel for media files (MP4, MP3, FLV), streaming content (HLS, M(PEG)-DASH, RTMP), and embeddable players like YouTube, Vimeo, Twitch, DailyMotion, Facebook, and SoundCloud. MediaElement.js is a set of custom Flash plugins that mimic the HTML5 MediaElement API for browsers that don't support HTML5 or don't support the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    FastRTC

    FastRTC

    The python library for real-time communication

    FastRTC is a Python library designed to simplify real-time communication (RTC), especially for audio and video streaming applications. It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Web Scrobbler

    Web Scrobbler

    Scrobble music all around the web!

    Web Scrobbler helps music listeners to scrobble their online playback history. Web Scrobbler is a browser extension created for people who listen to music online through their browser, and would like to keep an updated playback history using scrobbling services, such as Last.fm, Libre.fm and ListenBrainz. Download and install the extension for your browser. You can use the download buttons above. Open the extension options, and expand the "Accounts" section, then sign in to a scrobbling...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Markdownify MCP Server

    Markdownify MCP Server

    Convert files and web content into clean, usable Markdown easily

    Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows. Developers can install dependencies, build, and run the server...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Tagify

    Tagify

    Lightweight, efficient Tags input component in Vanilla JS

    Transforms an input field or a textarea into a Tags component, in an easy, customizable way, with great performance and a small code footprint, exploded with features. Customizable HTML templates for the different areas of the component (wrapper, tags, dropdown, dropdown item, dropdown header, dropdown footer) Shows suggestions list (flexible settings & styling) at full (component) width or next to the typed texted (caret) Allows setting suggestions' aliases for easier fuzzy-searching....
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    StableSwarmUI

    StableSwarmUI

    Multi-user UI for managing and running Stable Diffusion workflows tool

    StableSwarmUI is a web-based interface designed to manage and coordinate Stable Diffusion image generation workflows in a multi-user environment. It focuses on enabling multiple users to interact with shared resources, making it suitable for collaborative or server-based deployments. It provides a centralized system where users can submit, monitor, and manage generation tasks through a browser interface. It abstracts much of the complexity involved in running diffusion models by offering a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Phaser HTML5 Game Framework

    Phaser HTML5 Game Framework

    Phaser is a free and fast 2D game framework for making HTML5 games

    Phaser is a popular open-source 2D game framework for making HTML5 games for desktop and mobile platforms. Built with JavaScript and powered by WebGL and Canvas, it offers a robust API for developing everything from arcade to platformer and puzzle games.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Amazon Connect connect-rtc-js

    Amazon Connect connect-rtc-js

    Provide softphone support to AmazonConnect customers

    connect-rtc.js provides softphone support to AmazonConnect customers when they choose to directly integrate with AmazonConnect API and not use the AmazonConnect web application. It implements Amazon Connect WebRTC signaling protocol and integrates with browser WebRTC APIs to provide a simple contact session interface that can seamlessly integrate with Amazon Connect StreamJS. In a typical amazon-connect-streams integration, connect-rtc-js is not required on parent page. Softphone call...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Nextcloud Talk

    Nextcloud Talk

    Video- & audio-conferencing app for Nextcloud

    Nextcloud Talk is the official chat, video and audio conferencing app for Nextcloud that allows users to chat, call and screenshare with multiple other users. Nextcloud offers better protection for your communication as it provides end-to-end encryption and keeps even metadata from leaking. You can have private, group, public or password protected calls by simply inviting one person, a whole group, or sending a public link as an invitation to a call. It is also conveniently integrated with...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    ThumbmarkJS

    ThumbmarkJS

    World's best free browser fingerprinting library

    ThumbmarkJS is an MIT-licensed browser fingerprinting library that produces stable fingerprints with 90% uniqueness. It works with normal and private browsing. ThumbmarkJS is a free, open‑source browser fingerprinting JavaScript library, designed as an alternative to FingerprintJS. It generates distinct, persistent device fingerprints using web APIs like canvas, audio, fonts, WebGL, and more, enabling identification of browsers across sessions, even in incognito or cleared-cache scenarios. It supports both client-side-only installs via CDN and npm, with optional API integration for improved uniqueness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    mongo-express

    mongo-express

    Web-based MongoDB admin interface, written with Node.js

    A web-based MongoDB admin interface written with Node.js, Express, and Bootstrap 5.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    PlayCanvas Engine

    PlayCanvas Engine

    Fast and lightweight JavaScript game engine built on WebGL and glTF

    PlayCanvas is an open-source game engine. It uses HTML5 and WebGL to run games and other interactive 3D content in any mobile or desktop browser. PlayCanvas is used by leading companies in video games, advertising and visualization such as Animech, Arm, BMW, Disney, Facebook, Famobi, Funday Factory, IGT, King, Miniclip, Leapfrog, Mojiworks, Mozilla, Nickelodeon, Nordeus, NOWWA, PikPok, PlaySide Studios, Polaris, Product Madness, Samsung, Snap, Spry Fox, Zeptolab, Zynga. The PlayCanvas Engine...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 18
    xgplayer

    xgplayer

    A HTML5 video player with a parser that saves traffic

    xgplayer is a web-friendly, open-source media player library maintained by ByteDance, designed for playing audio/video streams in browsers or web applications with robust control, flexibility, and extensibility. It abstracts many of the lower-level complexities of HTML5 media, providing a consistent API for playback control, custom UI overlays, adaptive streaming, plugin hooks, and cross-browser compatibility.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    Screenity is a feature-packed screen and camera recorder for Chrome. Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 20
    Etherpad

    Etherpad

    A real-time collaborative document editor for the web

    Etherpad is a highly customizable online document editor that allows for collaborative editing in real-time by up to thousands of real-time users. With Etherpad, you don’t have to send documents back and forth-- simply set it up, share the link and collaborate with co-workers, fellow students, or friends on just about any written document! Etherpad provides all-access to data through a well-documented API and provides support for data export/import capabilities. It’s got an awesome set of...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Jitsi Meet

    Jitsi Meet

    Secure, simple and scalable video conferences application

    ...In fact, Jitsi Meet sounds better, thanks to HD audio with Opus; is anonymous, no need for an account, ever! It also keeps conversations private with encryption by default (and advanced security settings) and is developer-friendly, modify and customize it to your heart’s content. It’s 100% open source. Trade messages and emojis while you video conference, with integrated chat.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 22
    AriaNg Native

    AriaNg Native

    A better aria2 desktop frontend than AriaNg, containing all features

    AriaNg is a modern web frontend making aria2 easier to use. AriaNg is written in pure HTML & javascript, thus it does not need any compilers or runtime environment. You can just put AriaNg in your web server and open it in your browser. AriaNg uses a responsive layout and supports any desktop or mobile device. Pure Html & Javascript, no runtime required. Responsive design, supporting desktop and mobile devices. User-friendly interface. Sort tasks (by name, size, progress, remaining time,...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Rocket.Chat

    Rocket.Chat

    The communications platform that puts data protection first

    ...Talk to colleagues, partners, vendors, website visitors, or customers, regardless of the channel they choose to connect with you. Become a part of the largest open-source communications community in the world. Collaborate with millions of users and developers who are improving the way the world communicates.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    AMP

    AMP

    Web component framework for building ads, emails, websites and more

    AMP is an open source web component framework that allows you to easily create user-first websites, ads, emails, stories and more. AMP creates fast, smooth-loading web pages that prioritize the user-experience, consistently providing a fast experience across all devices and platforms.
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB