Open Source Python Multimedia Software - Page 5

Sort By:

Python Multimedia Software

Multimedia Python Clear Filters

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 9 This Week

Last Update: 2026-06-16
See Project
2

StreamCap

Multi-Platform Live Stream Automatic Recording Tool

StreamCap is a multi-platform live stream recording client that enables automated capture and management of live broadcasts from over 40 streaming platforms worldwide. Built on FFmpeg and StreamGet, it provides a robust system for monitoring live stream availability and automatically starting recordings when streams go online. The software supports batch recording and scheduled monitoring, allowing users to manage multiple streams simultaneously without manual intervention. It offers a wide range of output formats and includes automatic transcoding features to standardize recordings after capture. StreamCap also integrates notification systems to alert users when streams begin, enhancing real-time responsiveness. Designed for flexibility, it can run on desktop or web environments, making it suitable for both casual archiving and professional content workflows.

Downloads: 9 This Week

Last Update: 2026-07-06
See Project
3

ffmpeg-normalize

Audio Normalization for Python/ffmpeg

ffmpeg-normalize is a command-line utility designed to normalize audio levels in media files using FFmpeg, ensuring consistent volume across multiple tracks. It supports both EBU R128 loudness normalization and peak normalization methods, allowing users to choose the appropriate standard for their needs. The tool analyzes audio streams and applies adjustments to achieve target loudness levels without introducing distortion. It can process multiple files in batch mode, making it suitable for large media libraries or production workflows. ffmpeg-normalize also preserves metadata and supports a wide range of input and output formats. Its design emphasizes accuracy and compliance with broadcasting standards. Overall, it provides a reliable solution for achieving consistent audio quality in multimedia content.

Downloads: 9 This Week

Last Update: 2026-07-10
See Project
4

motionEyeOS

A video surveillance OS for single-board computers

motionEyeOS is a Linux distribution that turns a single-board computer into a video surveillance system. The OS is based on BuildRoot and uses motion as a backend and motionEye for the frontend. Compatible with most USB cameras as well as with the Raspberry PI camera module. Motion detection with email notifications and working schedule. JPEG files for still images, AVI files for videos. Connects to your local network using ethernet or wifi. File storage on SD card, USB drive or network SMB share. Uploading of media files to cloud storage services (Google Drive, Dropbox), media files are visible in the local network as SMB shares. Media files can also be accessed through the built-in FTP server or SFTP server.

Downloads: 9 This Week

Last Update: 2021-09-07
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

calibre

calibre - Ebook management

71 Reviews

Downloads: 40 This Week

Last Update: 2015-06-19
See Project
6

Monkey-DL

Bulk download your favourite anime episodes from your favourite anime

Monkey-DL is a command-line media downloader designed to retrieve video and audio content from online platforms with flexibility and automation. It integrates with tools like FFmpeg to handle post-processing tasks such as merging streams, converting formats, and optimizing output quality. The tool supports downloading single media files or entire playlists, enabling efficient batch operations. It includes options for selecting resolution, format, and output structure, giving users fine control over downloads. monkey-dl is built for simplicity, providing straightforward commands while still supporting advanced configurations. Its lightweight design makes it suitable for scripting and integration into automation workflows. Overall, it serves as a practical solution for downloading and organizing online media content.

Downloads: 8 This Week

Last Update: 2026-04-28
See Project
7

Unrud Video Downloader

Download videos from websites like YouTube and many others

Video Downloader is a desktop application designed to simplify the process of downloading videos from various online platforms through a user-friendly graphical interface. Built on top of yt-dlp, it abstracts the complexity of command-line tools and provides an accessible way for users to retrieve video and audio content. The application supports a wide range of features, including downloading entire playlists, handling private or password-protected content, and automatically selecting optimal formats based on user preferences. It also allows users to convert videos into audio files such as MP3, making it useful for media extraction workflows. The software is distributed across multiple platforms, including Linux package managers and containerized environments, ensuring broad accessibility. It includes configuration options and debugging capabilities for advanced users who want more control over the download process.

Downloads: 8 This Week

Last Update: 2026-04-09
See Project
8

nunif

Misc; latest version of waifu2x; 2D video to stereo 3D video

nunif is a deep learning–based image processing framework focused on image upscaling, restoration, denoising, and enhancement tasks using neural network models. The project provides a collection of AI-powered utilities designed primarily for anime-style artwork, illustrations, and high-quality image restoration workflows. It includes command-line tools and graphical interfaces for applying trained neural models to improve image resolution and visual clarity while minimizing artifacts. nunif supports GPU acceleration and batch processing, making it suitable for creators, archivists, and enthusiasts handling large image collections. The framework is highly modular, allowing developers to experiment with custom models, inference pipelines, and image-processing workflows. Its emphasis on anime and illustration enhancement has made it especially popular in digital art and media preservation communities.

Downloads: 8 This Week

Last Update: 2026-07-17
See Project
9

SMILI

Scientific Visualisation Made Easy

The Simple Medical Imaging Library Interface (SMILI), pronounced 'smilie', is an open-source, light-weight and easy-to-use medical imaging viewer and library for all major operating systems. The main sMILX application features for viewing n-D images, vector images, DICOMs, anonymizing, shape analysis and models/surfaces with easy drag and drop functions. It also features a number of standard processing algorithms for smoothing, thresholding, masking etc. images and models, both with graphical user interfaces and/or via the command-line. See our YouTube channel for tutorial videos via the homepage. The applications are all built out of a uniform user-interface framework that provides a very high level (Qt) interface to powerful image processing and scientific visualisation algorithms from the Insight Toolkit (ITK) and Visualisation Toolkit (VTK). The framework allows one to build stand-alone medical imaging applications quickly and easily.

Downloads: 192 This Week

Last Update: 2026-03-16
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
10

ReClip

Download videos from almost any website

ReClip is a lightweight, self-hosted media downloader that provides a simple web-based interface for downloading videos and audio from a wide range of online platforms. Built around the yt-dlp engine, it supports over a thousand websites, including major platforms like YouTube, TikTok, and Instagram, allowing users to retrieve media content in various formats. The application emphasizes simplicity and minimalism, featuring a clean interface built with plain HTML, CSS, and JavaScript without requiring complex build steps or frameworks. Its backend is implemented in a compact Python Flask application, making it easy to deploy and customize. Users can paste multiple URLs at once, select output formats such as MP4 or MP3, and choose quality settings before downloading. The system also includes features like automatic URL deduplication and batch processing to improve usability.

Downloads: 7 This Week

Last Update: 2026-07-09
See Project
11

Web RPA

Web Robotics Process Automation Tool

Web RPA is a browser automation framework designed to perform robotic process automation tasks directly within web environments. It enables users to automate repetitive actions such as form filling, data extraction, and workflow execution through programmable scripts. The system focuses on simplicity and flexibility, allowing automation without requiring complex infrastructure. It supports interaction with web elements, navigation flows, and dynamic content handling, making it suitable for scraping and automation scenarios. WebRPA can be integrated into larger systems or used as a standalone tool for automating browser-based operations. Its lightweight design ensures efficient execution while maintaining adaptability for different use cases. Overall, it provides a practical solution for automating web workflows and repetitive tasks.

Downloads: 7 This Week

Last Update: 3 days ago
See Project
12

kemono-dl

A simple downloader for kemono/coomer using python

kemono-dl is a Python-based downloader for Kemono and Coomer content archives. It is designed to save posts, attachments, separate files, metadata, and related creator content from supported pages. The project requires Python 3.11 or later and can be installed from the source package with pip. Its release history shows options for skipping attachments, writing post content, customizing output templates, and handling separate file types. The tool is mainly useful for personal archiving and structured downloading workflows. Because it can interact with creator content archives, users should consider copyright, consent, platform rules, and local laws before downloading or redistributing anything.

Downloads: 7 This Week

Last Update: 2026-07-06
See Project
13

Music Player Daemon

This SourceForge project page is obsolete. Please visit http://www.musicpd.org/

4 Reviews

Downloads: 45 This Week

Last Update: 2016-05-21
See Project
14

Curlew Multimedia Converter

Easy to use Multimedia Converter for Linux

8 Reviews

Downloads: 43 This Week

Last Update: 2018-05-26
See Project
15

DarkAudacity

A customized version of Audacity

A free sound editor, DarkAudacity is the well known Audacity sound editor now with a darker more modern theme - and a few small tweaks. The audio engine underneath is the same audio engine. The same code powers it. Like Audacity it is completely free. It's not a cut down trial evaluation version. You can record and play sounds, edit sounds, apply audio effects and save what you create for ringtones, podcasts and more. DarkAudacity is Open Source, free for you to download and use on your PC. Audacity and DarkAudacity come from a community effort. Many people have contributed to the audio code. Because they are Open Source, anyone is allowed to read and modify the source code. DarkAudacity is a variation on the Audacity software, made possible because Audacity is Open Source.

2 Reviews

Downloads: 57 This Week

Last Update: 2023-10-06
See Project
16

3D Gaussian Splatting

Original reference implementation of "3D Gaussian Splatting"

Gaussian Splatting is the official implementation of “3D Gaussian Splatting for Real-Time Radiance Field Rendering,” a research project for reconstructing and rendering 3D scenes from collections of images. The system represents scenes as millions of optimized 3D Gaussians rather than traditional meshes or neural fields, allowing high-quality novel view synthesis with real-time rendering performance. It includes training scripts, rendering tools, scene conversion utilities, and viewers for inspecting generated results. The project is widely used in computer graphics, spatial capture, virtual production, research, and experimental 3D reconstruction workflows. It relies on image-based reconstruction pipelines such as COLMAP to estimate camera positions before optimizing the Gaussian representation. Overall, Gaussian Splatting has become a foundational reference implementation for modern real-time radiance field rendering.

Downloads: 6 This Week

Last Update: 2026-05-08
See Project
17

Animated Drawings

Code to accompany "A Method for Animating Children's Drawings"

AnimatedDrawings is a framework that converts user sketches or line drawings into fully animated 2D motion sequences using learned motion priors. The idea is that you draw a simple static figure (stick figure, silhouette, or contour lines), and the system produces plausible skeletal motion (walking, jumping, dancing) that adheres to the drawn shape constraints. The architecture separates shape embedding (to understand user-drawn geometry) from motion embedding / generation (to produce temporally coherent movement). Users can provide rough keyframes or control constraints (pose anchors), and the system fills intermediate frames with fluid animation. The repository includes demonstration apps and notebooks where you can upload or draw shapes and watch animations play. Because the approach is data-driven, it generalizes to new drawings even with varying proportions or stylizations.

Downloads: 6 This Week

Last Update: 2025-10-07
See Project
18

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

AutoSub is a Python-based tool designed to automatically generate subtitles for video or audio content using speech recognition technology. It processes media files by extracting audio, transcribing spoken content, and generating subtitle files in standard formats. The tool supports multiple languages and can integrate with translation systems to produce subtitles in different languages. It is designed for automation, allowing batch processing of multiple media files. AutoSub leverages FFmpeg for media handling and integrates with speech recognition engines for transcription. It is particularly useful for content creators who want to quickly produce subtitles without manual effort. Overall, it simplifies the process of making media content accessible and searchable.

Downloads: 6 This Week

Last Update: 2026-04-28
See Project
19

Google Photos Sync

Google Photos and Albums backup with Google Photos Library API

Google Photos Sync is a backup tool for your Google Photos cloud storage. Google Photos Sync downloads all photos and videos the user has uploaded to Google Photos. It also organizes the media in the local file system using album information. Additional Google Photos 'Creations' such as animations, panoramas, movies, effects and collages are also backed up. This software is read only and never modifies your cloud library in any way, so there is no risk of damaging your data. There are a number of long standing issues with the Google Photos API that mean it is not possible to make a true backup of your media.

Downloads: 6 This Week

Last Update: 2024-07-14
See Project
20

Mirrorcast

Open Source Alternative to Chromecast, Mirror Desktop and Play media r

The idea is to replicate what Chromecast can do in regards to screen mirroring and streaming media to a remote display. Google chromes screen mirroring feature works well when used with a receiver such as Chromecast but this is a proprietary solution and audio does not work for desktop mirroring on some operating systems. At the moment, there is only a client for Debian/Ubuntu Operating systems and a server/receiver application for Raspberry pi. Mirrorcast aims to be a low latency screen mirroring solution with high-quality video and audio at 25-30fps, the later is why we will not use something like VNC. Mirrorcast uses up about the same amount of system resources as google chromes cast feature. The delay is less than 1 second on most networks. To achieve this we will use existing FOSS software such as ffmpeg, mpv, and omxplayer.

Downloads: 6 This Week

Last Update: 2023-08-04
See Project
21

auto-subtitle

Automatically generate and overlay subtitles for any video

auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. The system can also translate subtitles into English, enabling multilingual accessibility for video content. Once the required models are downloaded, it can operate offline, making it practical for local workflows. Designed for simplicity, it provides a streamlined way to automate subtitle creation without manual transcription effort.

Downloads: 6 This Week

Last Update: 2026-04-24
See Project
22

pytube

A lightweight, dependency-free Python library

Pytube is a lightweight, dependency-free Python library that enables downloading YouTube videos and audio streams with minimal setup. It supports video resolution selection, progressive or adaptive streams, and caption downloads. Pytube is ideal for automation scripts, archiving tools, and media applications that need to interface with YouTube content programmatically.

Downloads: 6 This Week

Last Update: 2025-07-01
See Project
23

Gnuplot.py

A Python interface to the gnuplot plotting program.

8 Reviews

Downloads: 31 This Week

Last Update: 2012-12-06
See Project
24

CamDesk

The Desktop Webcam Widget

CamDesk is a free, open source, desktop webcam widget, that was created as home surveillance application. Although others have used it for demonstrations even with CamStudio, and QuickTime Player for screen casting.

7 Reviews

Downloads: 42 This Week

Last Update: 2016-03-08
See Project
25

Natron

Open source, cross-platform compositing software

Natron is an open-source, cross-platform nodal compositing software.

Downloads: 150 This Week

Last Update: 2024-09-13
See Project