Showing 22 open source projects for "audio source separation"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    camofox-browser

    camofox-browser

    Headless browser automation server for AI agents to visit sites

    camofox-browser is a headless browser automation server built specifically for AI agents that need to interact with websites that often block standard automation stacks. It wraps Camoufox, a Firefox fork that performs fingerprint spoofing at the C++ level, which means many browser characteristics are altered before page scripts can inspect them, rather than relying on JavaScript-layer stealth patches. The project is designed around a REST API, making it easier for agents and external tools...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    notebooklm-py

    notebooklm-py

    Unofficial Python API and agentic skill for Google NotebookLM

    ...Its goal is to provide programmatic access not just to standard notebook operations, but also to many capabilities that are either limited or unavailable in the web interface, making it especially useful for automation and custom pipelines. The project covers notebook management, source ingestion, conversational querying, research workflows, and sharing controls, while also enabling the generation of a wide range of study and media artifacts. These outputs include audio overviews, videos, slide decks, infographics, quizzes, flashcards, reports, data tables, and mind maps, with configurable formats and export options.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 3
    Agentic Inbox

    Agentic Inbox

    A self-hosted email client with an AI agent, running entirely on Cloud

    Agentic Inbox is a self-hosted email client that integrates an AI agent directly into the inbox experience, enabling automated reading, organization, and drafting of emails. It runs entirely on Cloudflare Workers, using serverless infrastructure to manage incoming and outgoing messages without relying on external email services. Each mailbox is isolated with its own storage, ensuring data separation and security while maintaining performance. The system supports full email functionality,...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    Claude Code Video Vision

    Claude Code Video Vision

    Give Claude the ability to watch and understand videos

    Claude Video Vision is a plugin designed for Claude Code that enables large language models to process and understand video content by transforming it into multimodal inputs the model can reason over. Instead of attempting to directly interpret raw video streams, the system extracts key frames using tools like ffmpeg and processes audio through transcription engines, converting both visual and auditory signals into structured inputs for the model. The result is a perception layer that feeds...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 5
    Claude Code Plugins

    Claude Code Plugins

    Intelligent automation and multi-agent orchestration for Claude Code

    Claude Code Plugins is a lightweight framework designed to define, manage, and execute AI agents in a modular and extensible way, typically focusing on orchestrating tasks using large language models and tool integrations. The project provides abstractions for building agents that can interpret instructions, execute commands, and interact with external systems in a structured workflow. It emphasizes simplicity and composability, allowing developers to define agent behaviors through reusable...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    NanoClaw

    NanoClaw

    A lightweight alternative to Clawdbot / OpenClaw

    Nanoclaw is a lightweight, security-focused personal agent runtime designed as a slimmer alternative to larger “personal assistant” agent stacks, with an emphasis on being easy to audit and safe by default. It runs agent execution inside Apple containers to provide strong isolation boundaries, so individual chats and actions can be sandboxed with tighter filesystem and process separation than a typical single-process bot. The project connects directly to WhatsApp, letting you deploy an...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    clawhip

    clawhip

    claw + whip: Event-to-channel notification router

    Clawhip is an open-source daemon-first notification router designed to deliver structured events from development workflows directly to platforms like Discord and Slack. It acts as a central event-processing system that listens to sources such as Git, GitHub, tmux sessions, and custom CLI events, then routes them through a typed pipeline. Built with a clean separation between routing, rendering, and delivery, Clawhip ensures reliable and organized notifications without polluting AI agent contexts. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    E2B

    E2B

    Secure open source cloud runtime for AI apps & AI agents

    E2B's Code Interpreter SDK allows you to add code-interpreting capabilities to your AI apps. E2B Sandbox is a secure sandboxed cloud environment made for AI agents and AI apps. Sandboxes allow AI agents and apps to have long-running cloud secure environments. In these environments, large language models can use the same tools as humans do.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    OpenAI Python

    OpenAI Python

    The official Python library for the OpenAI API

    The OpenAI Python library provides convenient access to the OpenAI REST API from any Python 3.7+ application. The library includes type definitions for all request params and response fields, and offers both synchronous and asynchronous clients powered by httpx.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Agentex

    Agentex

    Open source codebase for Scale Agentex

    AgentEX is an open framework from Scale for building, running, and evaluating agentic workflows, with an emphasis on reproducibility and measurable outcomes rather than ad-hoc demos. It treats an “agent” as a composition of a policy (the LLM), tools, memory, and an execution runtime so you can test the whole loop, not just prompting. The repo focuses on structured experiments: standardized tasks, canonical tool interfaces, and logs that make it possible to compare models, prompts, and tool...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    video-use

    video-use

    Edit videos with Claude Code

    Video Use is an open-source AI-powered video editing tool that allows users to transform raw footage into polished videos using natural language commands. Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    OpenAI Realtime Agents

    OpenAI Realtime Agents

    This is a simple demonstration of more advanced, agentic patterns

    This repository demonstrates how to build low-latency, streaming “voice + chat” agents using OpenAI’s Realtime API combined with the OpenAI Agents SDK. The demo shows patterns for connecting a realtime voice stream (audio in/out) with agents that can use tools, maintain state, and orchestrate multi-agent workflows. The SDK offers abstractions such as agent orchestration, event handling, handoffs, state management, and guardrails, tailored to support realtime, conversational systems. The demo...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    BotSharp

    BotSharp

    AI Multi-Agent Framework in .NET

    Conversation as a platform (CaaP) is the future, so it's perfect that we're already offering the whole toolkits to our .NET developers using the BotSharp AI BOT Platform Builder to build a CaaP. It opens up as much learning power as possible for your own robots and precisely control every step of the AI processing pipeline. BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio processing technologies, and aims to promote the development and application of intelligent robot assistants in information systems. Out-of-the-box machine learning algorithms allow ordinary programmers to develop artificial intelligence applications faster and easier. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    NodeTool

    NodeTool

    Visual AI Workflow Builder

    NodeTool is an open‑source, visual AI workflow builder that lets you connect nodes for text, images, audio, video, data, and automation—then run them locally or on the cloud. Build multi‑step agents, RAG systems, and creative media pipelines without coding, inspect execution in real time, and deploy anywhere: home server, private VPC, RunPod, or Cloud Run.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    ISS: a toolset for visual artists, composers and researchers in the area of: artificial life, sound synthesis, interactive art installations, immersive user interaction, sound spacialization. More information can be found on http://swarms.cc
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 18
    BlueJam is a Java-based algorithmic music composer that uses evolutionary techniques and heuristics. Originally intended to evolve solos on the blues scale. BlueJam interfaces with Pure-Data to give real-time output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A multiagent system with elements of artificial intelligence capable of composing and performing its own music.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    BlueWar is a 3D Multiplayer Real-Time Strategy Game (RTS) that features a futuristic combat on the surfaces of various planets in the universe.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Music agent is a software agent designed to help people discover new creative commons licensed music according to their personal taste.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This project is an implementation of a computational framework that addresses general-interest low-level problems such as real-time synchronization, sound communication and spatial agent mobility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB