Search Results for "audio video streaming server source code" - Page 2

Sort By:

Showing 103 open source projects for "audio video streaming server source code"

View related business solutions

Python Clear Filters & Widen Search

Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
1

DocArray

The data structure for multimodal data

DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. Data...

Downloads: 0 This Week

Last Update: 2025-03-21
See Project
2

Dia

A TTS model capable of generating ultra-realistic dialogue

Dia is a neural text-to-speech model designed specifically for generating ultra-realistic dialogue in a single pass. Instead of focusing on isolated sentences or flat narration, it is optimized for conversational audio, complete with natural turn-taking, prosody, and pacing. The model can be conditioned on a reference audio sample, allowing you to control emotion, tone, and other stylistic aspects of the speech. It can also produce nonverbal vocalizations like laughter, coughs, clearing the...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
3

notebooklm-py

Unofficial Python API and agentic skill for Google NotebookLM

notebooklm-py is an unofficial Python API and agent-ready integration layer for Google NotebookLM that exposes NotebookLM functionality through code, the command line, and AI agent workflows. Its goal is to provide programmatic access not just to standard notebook operations, but also to many capabilities that are either limited or unavailable in the web interface, making it especially useful for automation and custom pipelines. The project covers notebook management, source ingestion,...

Downloads: 6 This Week

Last Update: 2026-03-17
See Project
4

ComfyUI

The most powerful and modular diffusion model GUI, api and backend

The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...

Downloads: 180 This Week

Last Update: 2026-03-24
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Jina

Build cross-modal and multimodal applications on the cloud

Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...

Downloads: 0 This Week

Last Update: 2024-11-12
See Project
6

Bulk Image and Audio Merger

Easily pair images with audio file counterparts in bulk

...Works on Windows 7 and above. (Source code is included with the program .zip file.)

Downloads: 0 This Week

Last Update: 2025-07-27
See Project
7

OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model

OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak...

Downloads: 15 This Week

Last Update: 2025-11-28
See Project
8

FastKoko

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

FastKoko is a self-hosted text-to-speech server built around the Kokoro-82M model and exposed through a FastAPI backend. It is designed to be easy to deploy via Docker, with separate CPU and GPU images so that users can choose between pure CPU inference and NVIDIA GPU acceleration. The project exposes an OpenAI-compatible speech endpoint, which means existing code that talks to the OpenAI audio API can often be pointed at a Kokoro-FastAPI instance with minimal changes. It supports multiple...

Downloads: 1 This Week

Last Update: 2025-12-13
See Project
9

Werkzeug

The comprehensive WSGI web application library

Werkzeug is a comprehensive WSGI web application library. It began as a simple collection of various utilities for WSGI applications and has become one of the most advanced WSGI utility libraries. Werkzeug doesn’t enforce any dependencies. It is up to the developer to choose a template engine, database adapter, and even how to handle requests. Includes an interactive debugger that allows inspecting stack traces and source code in the browser with an interactive interpreter for any frame in...

Downloads: 1 This Week

Last Update: 3 days ago
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

Scrapling

An adaptive Web Scraping framework

Scrapling is an adaptive web scraping framework designed to handle everything from a single HTTP request to large-scale, concurrent crawls. Built for modern websites, it intelligently adapts to structural changes by automatically relocating elements when page layouts update. The framework includes advanced fetchers capable of bypassing anti-bot protections such as Cloudflare Turnstile using stealth and browser automation techniques. Its powerful spider system supports multi-session crawling,...

Downloads: 2 This Week

Last Update: 1 day ago
See Project
11

Deep Lake

Data Lake for Deep Learning. Build, manage, and query datasets

Deep Lake (formerly known as Activeloop Hub) is a data lake for deep learning applications. Our open-source dataset format is optimized for rapid streaming and querying of data while training models at scale, and it includes a simple API for creating, storing, and collaborating on AI datasets of any size. It can be deployed locally or in the cloud, and it enables you to store all of your data in one place, ranging from simple annotations to large videos. Deep Lake is used by Google, Waymo,...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
12

FlowLens MCP

Open-source MCP server that gives your coding agent

FlowLens MCP Server is an open-source tool designed to give AI-powered coding agents (like Claude Code, Cursor, GitHub Copilot / Codex, and others) full, replayable browser context to dramatically improve debugging, bug reporting, and regression testing for web applications. It works together with a companion browser extension: when a user reproduces a bug or a complicated UI interaction, the extension captures a rich session log, including screen/video recording, network traffic, console logs, DOM events, storage changes, and more, and exports it. ...

Downloads: 0 This Week

Last Update: 2025-12-05
See Project
13

MagicBox Player

Magic Box 🎶: The Open-Source Multimedia Player Magic Box is a versatile, custom-built media player for desktop environments, blending a classic interface with powerful, modern features. Developed in Python with PyQt5, it supports a wide range of audio and video formats. Key Features: Dynamic Visualizer: Features a real-time, custom FFT audio spectrum visualizer that monitors system loopback audio, providing vibrant, data-driven feedback (requires manual loopback setup like Stereo Mix/PulseAudio). ...

Downloads: 2 This Week

Last Update: 2026-03-07
See Project
14

WebCam_Streamer

Stream your laptop or external webcam over LAN or Wi-Fi hotspot.

WebCam Streamer is a professional, lightweight application that lets you stream your laptop or external webcams in real-time. Stream over a local network (LAN) or create a temporary Wi-Fi hotspot for direct access without complex setup. Features a modern, resizable dark-themed GUI with a dropdown to select any connected camera, live status display, and QR code for instant browser access. Supports multiple webcams, switching between them seamlessly, and keeps streaming simple with start/stop...

Downloads: 4 This Week

Last Update: 2025-10-04
See Project
15

HunyuanOCR

OCR expert VLM powered by Hunyuan's native multimodal architecture

...HunyuanOCR handles complex documents: multi-column layouts, tables, mathematical formulas, mixed languages, handwritten or stylized fonts, receipts, tickets, and even video-frame subtitles. The project provides code, pretrained weights, and inference instructions, making it feasible to deploy locally or on a server, and to integrate with applications.

Downloads: 0 This Week

Last Update: 2026-01-13
See Project
16

OpenBroadcaster Obplayer

Broadcast Automation Emergency Alerting

OpenBroadcaster has a simple and effective web-based automation system with scheduler to conveniently manage your station from just about anywhere, through the creation of content lists that can play out over both terrestrial or internet radio and TV, using a mobile-friendly 'drag-and-drop' interface. Setting up repeating content, such as station identification, public service announcements or advertisements is "point and click" easy.

Downloads: 3 This Week

Last Update: 2026-01-24
See Project
17

TensorBoardX

tensorboard for pytorch (and chainer, mxnet, numpy, etc.)

The SummaryWriter class provides a high-level API to create an event file in a given directory and add summaries and events to it. The class updates the file contents asynchronously. This allows a training program to call methods to add data to the file directly from the training loop, without slowing down training. TensorboardX now supports logging directly to Comet. Comet is a free cloud based solution that allows you to automatically track, compare and explain your experiments. It adds a...

Downloads: 0 This Week

Last Update: 2025-06-10
See Project
18

MLT Multimedia Framework

A multimedia authoring and processing framework and a video playout server for television broadcasting.

17 Reviews

Downloads: 5 This Week

Last Update: 2025-12-31
See Project
19

KCast

chromecast videos from KDE desktop

KCast is a KDE Plasma 6 widget that lets you cast video files or youtube URLs to a Chromecast devices in your local network. It supports device discovery, local media playback via an embedded HTTP server, and drag-and-drop integration with browsers and file managers like Dolphin.

Downloads: 0 This Week

Last Update: 2025-07-24
See Project
20

Dynamite Download Manager

PyIDM remake for downloading stuff

Dynamite Download Manager is a powerful download manager equipped with multi-connections and a high-speed engine, designed to enhance your downloading experience. By utilizing multiple connections, DDM splits files into smaller segments and downloads them simultaneously, significantly increasing download speeds. Its advanced high-speed engine ensures faster and more efficient downloading, even for large files. DDM supports a wide variety of file formats, enabling you to download general...

Downloads: 4 This Week

Last Update: 2025-09-08
See Project
21

Spyder AI Chat Plugin

Spyder IDE plugin providing separate chat pane for AI Assistance

An OpenAI-compatible chat pane for Spyder 6.x. Supports OpenAI, Ollama, LM Studio, and any other server that exposes an OpenAI-compatible /v1/chat/completions endpoint. Installation with PyPi in the same environment as Spyder IDE: (spyder) $ pip install spyder-ai-chat Or from source: # clone / download and unzip the project source code, then: (spyder) $ cd spyder_ai_chat (spyder) $ pip install -e .

Downloads: 0 This Week

Last Update: 2026-03-26
See Project
22

How to get Bot Lobbies

Get easy bot lobbies in any game with our bot lobbies tool.

Test it on the web app: https://slothytech.com/ezlobbies/ EzLobbies is the open-source tool developed by SlothyTech that helps gamers get easier, bot-filled lobbies in the world’s top games. Instantly generate custom OpenVPN config files that route only your game’s matchmaking traffic through the VPN server of your choice—no complicated setup and no speed loss for browsing or streaming. Supported titles include Fortnite, Call of Duty, Warzone, Apex Legends, Overwatch, Valorant, PUBG, and...

2 Reviews

Downloads: 6 This Week

Last Update: 2025-06-14
See Project
23

ufonet

UFONet - Denial of Service Toolkit

UFONet - Is a set of hacktivist tools that allow launching coordinated DDoS and DoS attacks and combine both in a single offensive. It also works as an encrypted DarkNET to publish and receive content by creating a global client/server network based on a direct-connect P2P architecture. + FAQ: https://ufonet.03c8.net/FAQ.html -------------------------------------------- -> UFONet-v1.8 [DPh] "DarK-PhAnT0m!" (.zip) -> md5 = [ c8ab016f6370c8391e2e6f9a7cbe990a ] -> UFONet-v1.8...

Downloads: 9 This Week

Last Update: 2024-08-23
See Project
24

Errbot

Chatbot daemon that connects to your favorite chat services

Errbot is a chatbot, a daemon that connects to your favorite chat service and brings your tools into the conversation. The goal of the project is to make it easy for you to write your own plugins so you can make it do whatever you want, a deployment, retrieving some information online, trigger a tool via an API, troll a co-worker, etc. Errbot is being used in a lot of different contexts, chatops (tools for devops), online gaming chatrooms like EVE, video streaming chatrooms like...

Downloads: 0 This Week

Last Update: 2024-01-01
See Project
25

Demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Demucs (Deep Extractor for Music Sources) is a deep-learning framework for music source separation—extracting individual instrument or vocal tracks from a mixed audio file. The system is based on a U-Net-like convolutional architecture combined with recurrent and transformer elements to capture both short-term and long-term temporal structure. It processes raw waveforms directly rather than spectrograms, allowing for higher-quality reconstruction and fewer artifacts in separated tracks. The...

Downloads: 84 This Week

Last Update: 2025-10-12
See Project