Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "audio source separation"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 27
Windows 26
Mac 25
More...
BSD 15
ChromeOS 14
Desktop Operating Systems 2

Category

Artificial Intelligence 28
Multimedia 9
Internet 3
Software Development 3
Business 2
Games 1
Productivity 1
Scientific/Engineering 1
Security 1

License

OSI-Approved Open Source 27
Other License 1

Translations

English 5
Chinese (Simplified) 2
Swedish 1

Programming Language

JavaScript 28
Java 4
TypeScript 4
C 3
C++ 3
More...
Python 3
C# 2
PHP 2
BASIC 1
Elixir 1
MATLAB 1
Perl 1
Unix Shell 1

Status

Beta 3
Alpha 2
Production/Stable 2
Planning 1
More...
Pre-Alpha 1

Showing 28 open source projects for "audio source separation"

View related business solutions

Artificial Intelligence JavaScript Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
1

StableSwarmUI

Multi-user UI for managing and running Stable Diffusion workflows tool

StableSwarmUI is a web-based interface designed to manage and coordinate Stable Diffusion image generation workflows in a multi-user environment. It focuses on enabling multiple users to interact with shared resources, making it suitable for collaborative or server-based deployments. It provides a centralized system where users can submit, monitor, and manage generation tasks through a browser interface. It abstracts much of the complexity involved in running diffusion models by offering a...

Downloads: 3 This Week

Last Update: 2026-03-18
See Project
2

Markdownify MCP Server

Convert files and web content into clean, usable Markdown easily

Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows. Developers can install dependencies, build, and run the server...

Downloads: 3 This Week

Last Update: 10 hours ago
See Project
3

Buster

Captcha solver extension for humans

Save time by asking Buster to solve captchas for you. Buster is a Firefox extension which helps you to solve difficult captchas by completing reCAPTCHA audio challenges using speech recognition. Challenges are solved by clicking on the extension button at the bottom of the reCAPTCHA widget. It is not guaranteed that challenges are always solved, the limitations of the technology need to be considered. The continued development of Buster is made possible thanks to the support of awesome...

Downloads: 19 This Week

Last Update: 2024-06-04
See Project
4

AI-Media2Doc

AI tool converting video/audio into structured documents instantly

AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not...

Downloads: 3 This Week

Last Update: 2026-03-18
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

Agili Hacker Podcast

AI tool that turns Hacker News posts into daily podcast updates

...Built with modern web technologies, the project focuses on automation, speed, and accessibility. It supports continuous updates, allowing listeners to receive fresh insights daily. As an open-source tool, it also encourages community contributions and customization for developers who want to adapt or extend its workflow for similar AI-driven content pipelines.

Downloads: 2 This Week

Last Update: 10 hours ago
See Project
6

comfyui-mixlab-nodes

Workflow and speech recognition app

comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that...

Downloads: 4 This Week

Last Update: 2025-11-28
See Project
7

WanGP

AI video generator optimized for low VRAM and older GPUs use

Wan2GP is an open source AI video generation toolkit designed to make modern generative models accessible on consumer-grade hardware with limited GPU memory. It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and certain AMD GPUs. ...

Downloads: 11 This Week

Last Update: 1 day ago
See Project
8

annyang!

Speech recognition for your site

annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
9

FastRTC

The python library for real-time communication

FastRTC is a Python library designed to simplify real-time communication (RTC), especially for audio and video streaming applications. It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat,...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

xgplayer

A HTML5 video player with a parser that saves traffic

xgplayer is a web-friendly, open-source media player library maintained by ByteDance, designed for playing audio/video streams in browsers or web applications with robust control, flexibility, and extensibility. It abstracts many of the lower-level complexities of HTML5 media, providing a consistent API for playback control, custom UI overlays, adaptive streaming, plugin hooks, and cross-browser compatibility.

Downloads: 0 This Week

Last Update: 7 days ago
See Project
11

Deep Chat

Customizable AI chat component for websites with API support

Deep Chat is a highly customizable web component designed to simplify the integration of AI-powered chat interfaces into websites. It allows developers to embed a fully functional chatbot using minimal setup, while still offering extensive control over behavior, appearance, and integrations. Deep Chat supports connections to a wide range of AI services as well as custom backends, enabling flexible deployment for different use cases. It is built as a framework-agnostic solution, meaning it...

Downloads: 3 This Week

Last Update: 2026-03-18
See Project
12

Venom

Venom is the most complete javascript library for Whatsapp

Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on artificial intelligence and all types of design architecture for WhatsApp. It's a high-performance alternative API to whatzapp, you can send, text messages, files, images, videos and more. Remember, the API was developed on a platform called RESTful Web services, providing interoperability...

Downloads: 0 This Week

Last Update: 2024-11-15
See Project
13

Node.js Telegram Bot API

Telegram Bot API for NodeJS

TelegramBot is an EventEmitter that emits several events. Message, received a new incoming Message of any kind. Depending on the properties of the Message, one of these events may ALSO be emitted, text, audio, document, photo, sticker, video, voice, contact, location, new_chat_members, left_chat_member, new_chat_title, new_chat_photo, delete_chat_photo, group_chat_created, game, pinned_message, poll, dice, migrate_from_chat_id, migrate_to_chat_id, channel_chat_created,...

Downloads: 0 This Week

Last Update: 2025-12-13
See Project
14

eGuideDog free software for the blind

eGuideDog project develops free software for the blind. Currently, we focus on WebSpeech, Ekho TTS and WebAnywhere.

16 Reviews

Downloads: 193 This Week

Last Update: 2 days ago
See Project
15

MyBox

Easy Tools of PDF, Image, File, Network, Data, and Medias

javafx-desktop-apps pdf image ocr icc barcode color-palette text bytes markdown html archive compress digest video audio editor converter media https://github.com/Mararsh/MyBox Self-contain packages need not java env nor installation. Jar packages need Java 16 or higher.

Downloads: 0 This Week

Last Update: 2026-02-10
See Project
16

ChatGPT UI

A ChatGPT web client that supports multiple users, and databases

A ChatGPT web client that supports multiple users, multiple database connections for persistent data storage, supports i18n. Provides Docker images and quick deployment scripts. Support gpt-4 model. You can select the model in the "Model Parameters" of the front-end. The GPT-4 model requires whitelist access from OpenAI. Added web search capability to generate more relevant and up-to-date answers from ChatGPT! This feature is off by default, you can turn it on in `Chat->Settings` in the...

Downloads: 2 This Week

Last Update: 2023-09-07
See Project
17

OpenAI Web Application

A web application that allows users to interact with OpenAI's models

A web application that allows users to interact with OpenAI's modles through a simple and user-friendly interface. This app is for demo purpose to test OpenAI API and may contain issues/bugs. User-friendly interface for making requests to the OpenAI API. Responses are displayed in a chat-like format. Select Models (Davinci, Codex, DALL·E, Whisper) based on your needs. Create AI Images (DALL·E). Audio-Text Transcribe (Whisper). Highlight code syntax. Type in the input field and press enter or...

Downloads: 0 This Week

Last Update: 2023-03-23
See Project
18

ml5.js

Friendly machine learning for the web

A neighborly approach to creating and exploring artificial intelligence in the browser. ml5.js aims to make machine learning approachable for a broad audience of artists, creative coders, and students. The library provides access to machine learning algorithms and models in the browser, building on top of TensorFlow.js with no other external dependencies.

Downloads: 1 This Week

Last Update: 2024-08-01
See Project
19

Read Aloud

An awesome browser extension that reads aloud webpage content

Read Aloud is a browser extension for Chrome, Firefox, and other Chromium-based browsers that converts webpage text to audio using text-to-speech technology. It is designed to work on a wide variety of sites, including news, blogs, online textbooks, course materials, fanfiction, and more. The extension targets users who prefer listening over reading, as well as people with dyslexia, other learning disabilities, or eye strain, and children learning to read. Read Aloud lets users choose from...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
20

XZVoice

Free and open source text-to-speech software

Text-to-speech software developed by Electron + vue + ElementUI + js. The high-fidelity and flexible configuration of speech synthesis products opens up the closed loop of human-computer interaction and enables applications to sound realistically. A variety of timbres are available, and functions such as adjusting speech rate, intonation, and volume are provided. Technically, multi-level rhythmic pauses are taken into account to achieve the purpose of natural synthesizing rhythm, and...

Downloads: 5 This Week

Last Update: 2022-10-04
See Project
21

Universal Data Tool

Collaborate & label any type of data, images, text, or documents etc.

An open-source tool and library for creating and labeling datasets of images, audio, text, documents and video in an open data format. The Universal Data Tool can be used by anyone on your team, no data or programming skills needed. Simplicity without sacrificing any powerful developer features and integrations. Use the Universal Data Tool directly from a web browser or with a Windows, Mac or Linux desktop application.

Downloads: 0 This Week

Last Update: 2022-08-11
See Project
22

Marinara

Pomodoro® time management assistant for Chrome

...Marinara is not affiliated or associated with or endorsed by Pomodoro®, The Pomodoro Technique®, or Francesco Cirillo. Configurable timer durations. Desktop & tab notifications. Audio notifications with over 20 sounds. Ticking timer sounds. Scheduled automatic timers. Open-source software. Currently, Marinara is configured for developers working and packaging releases on Mac OS. Support for Linux or Windows is welcome. Marinara uses the system ruby and makes tools to build releases. This will produce a packaged extension ready for uploading to the Chrome Web Store in the root directory of the project.

Downloads: 3 This Week

Last Update: 2022-05-04
See Project
23

chatbot_chung

chatbot chung is a keywords based probabilities algorythm simple entertainment chatbot with 3D talking openGL avatars written in freebasic. Can import aiml simple question/answer or question/random/answers or single star/ multi srai data saved from "AIML_chung" open source application . Online html5 javascript version with 44 languages multilingual auto detection available on the website (source included in the zip file). SORT gentext text generation algorythm option added (desktop version) .

Downloads: 0 This Week

Last Update: 2020-06-27
See Project
24

Text to Voice

'Text to Voice' or 'Text to Speech' is 1 of the coolest Firefox add-ons. It gives ur brwsr the pwr of speech. Select txt, clck the bttn on the bttm rite & this add-on spks the selectd txt 4 u. Isn't it brllant? Moreovr odio file cn b dnloaded as

Downloads: 4 This Week

Last Update: 2014-04-25
See Project
25

Speedy Composer

Speedy Composer – Artificial Neural Network Melody Composer.

Thank you for your interest in Speedy Composer. Speedy Composer is an automated application for composing melodies for Speedy Net members. We recently made changes to the source code of Speedy Net, and converted it into the Python language and Django framework. Since Speedy Composer was originally written in PHP, it is not adapted to work with Speedy Net in its current form. So unfortunately we were forced to temporarily close the app Speedy Composer. But don't worry, we kept backups of all...

Downloads: 0 This Week

Last Update: 2020-09-24
See Project

Previous
You're on page 1
2
Next

Related Searches

ekho

ekho tts

captcha

convertarabicaudiototext

whatsapp for kali linux

telegram bot

merge video

php admin panel

web application

text to speech 2023

Related Categories

Artificial Intelligence

Multimedia

Internet

Software Development

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise