Open Source Python Multimedia Software - Page 8

Sort By:

Python Multimedia Software

Multimedia Python Clear Filters

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
1

LiVES

LiVES is a Video Editing System. It is designed to be simple to use, y

LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime effects, trim and edit your clips in the clip editor, and bring them together using the multitrack timeline. You can even record your performance in real time, and then edit it further or render it straight away. For the more technically minded, the application is frame and sample accurate, and it can be controlled remotely or scripted for use as a video server. And it supports all of the latest free standards.

15 Reviews

Downloads: 12 This Week

Last Update: 2020-11-08
See Project
2

Video Object Remover – Frame-Accurate

🎥 A free & open-source Python tool to remove unwanted objects from videos frame-by-frame using brush masking and AI inpainting (OpenCV + FFmpeg). EXE included.

Video Object Remover – Frame Accurate Edition is a free and open-source desktop application that helps you remove unwanted objects, logos, or watermarks from videos using brush-based masking and AI inpainting. The tool extracts video frames using FFmpeg, lets you mask objects frame-by-frame, and removes them using OpenCV. Built with Python and Tkinter, it features a modern dark-themed GUI, adjustable brush tool, zoom control, and real-time logging. The cleaned video is rebuilt and exported with original quality. Includes a precompiled Windows EXE for normal users (no Python required) and full source code for developers or students. Perfect for YouTubers, video editors, educators, and open-source enthusiasts. 🖥️ Website: https://projectworlds.in 📺 YouTube: https://youtube.com/@projectworlds 📬 Support: https://projectworlds.in/contact-us

Downloads: 59 This Week

Last Update: 2025-06-27
See Project
3

PyMOL Molecular Graphics System

PyMOL is an OpenGL based molecular visualization system

The Open-Source PyMOL repository has been moved to github: https://github.com/schrodinger/pymol-open-source We still use the pymol-users mailing list here on sourceforge. Please subscribe for community support: https://pymol.org/maillist (Note: SourceForge email newsletter and special offers are optional and can be unchecked) The PyMOL community wiki has its own home: https://pymolwiki.org/

Downloads: 57 This Week

Last Update: 2020-06-02
See Project
4

easycap-app

Capture your screen with unprecedented ease and quality.

Welcome to EasyCap, your ultimate desktop screen recorder and screenshot editor. Designed with simplicity and power in mind, EasyCap is perfect for professionals, creators, and anyone looking to capture their PC activities with ease. Whether you're creating tutorials, recording gameplay, or capturing important moments, EasyCap makes it effortless.

1 Review

Downloads: 31 This Week

Last Update: 2024-12-21
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
5

Airtime

Open source broadcast automation software for scheduling and playout

Airtime lets you take total control of your radio station via the web with intelligent archive management, powerful search, an easy playlist builder, a simple scheduling calendar and rock-solid automated playout. Features include Smart Blocks, live assist modes, WAV, FLAC, AAC, MP3 and OGG support, fades, cues, playlists, programme calendar, Icecast, Shoutcast and Soundcloud integration, DJ and station manager roles, JQuery widgets, Liquidsoap playout, and a record and rebroadcast functionality. Powerful media archive features 'watch' folders to allow stations to magically synchronise files, auto-import files and has multiple directory support. Airtime is free to download, runs on Ubuntu and Debian, and users can interact with it through any web browser. Airtime now comes with an 'easy install' package.

7 Reviews

Downloads: 15 This Week

Last Update: 2015-04-08
See Project
6

AV1 AVIF

AV1 Image File Format Specification - ISO-BMFF/HEIF derivative

AV1 AVIF is the official specification and reference design for the AV1 Image File Format (AVIF), defining how AV1-encoded bitstreams are packaged into the HEIF container format (based on ISOBMFF) to produce AVIF files. The project outlines the syntax and semantics required for AVIF compliance, including support for multiple image profiles, color depths, chroma subsampling modes, HDR/WCG, alpha channels, animation/image sequences, and various color-space/bit-depth combinations — making AVIF a versatile, modern image format suitable for both simple photos and advanced imagery needing high fidelity. The specification ensures interoperability across encoders and decoders, providing guidelines so that images created by any compliant AVIF encoder can be reliably decoded by compliant decoders. As adoption grows, AV1 AVIF plays a crucial role in promoting a royalty-free, open, high-efficiency image standard that competes with older formats such as JPEG and newer proprietary ones.

Downloads: 2 This Week

Last Update: 2025-12-08
See Project
7

AudioNotes

Extract audio and video content and organize it into a Markdown note

AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an open-source repository, AudioNotes provides developers or power users the opportunity to customize how audio is captured, stored, annotated, and replayed — e.g. adding playback speed control, export to standard formats, or synchronization between notes and audio timeline. It may support simple UI for starting/stopping recordings, writing or editing notes, and navigating through recorded sessions.

Downloads: 2 This Week

Last Update: 2026-07-22
See Project
8

Diagrams

Diagram as Code for prototyping cloud system architectures

Diagrams lets you draw the cloud system architecture in Python code. It was born for prototyping a new system architecture without any design tools. You can also describe or visualize the existing system architecture as well. Diagram as Code allows you to track the architecture diagram changes in any version control system. Diagrams currently support main major providers including AWS, Azure, GCP, Kubernetes, Alibaba Cloud, Oracle Cloud, etc. It also supports On-Premise nodes, SaaS and major Programming frameworks and languages. It does not control any actual cloud resources nor does it generate cloud formation or terraform code. It is just for drawing the cloud system architecture diagrams.

Downloads: 2 This Week

Last Update: 2025-11-22
See Project
9

Image Downloader

Download images from Google, Bing, Baidu

Crawl and download images using Selenium Using python3 and PyQt5. Supported Search Engine: Google, Bing, Baidu. Keywords input from the keyboard or input from line separated keywords list file for batch process. Download image using a customizable number of threads. Fully supported conditional search (eg. filetype:, site:). Switch for Google safe mode. Proxy configuration (socks, HTTP). CMD and GUI ways of using are provided.

Downloads: 2 This Week

Last Update: 2023-04-03
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
10

Mesh R-CNN

code for Mesh R-CNN, ICCV 2019

Mesh R-CNN is a 3D reconstruction and object understanding framework developed by Facebook Research that extends Mask R-CNN into the 3D domain. Built on top of Detectron2 and PyTorch3D, Mesh R-CNN enables end-to-end 3D mesh prediction directly from single RGB images. The model learns to detect, segment, and reconstruct detailed 3D mesh representations of objects in natural images, bridging the gap between 2D perception and 3D understanding. Unlike voxel-based or point-based approaches, Mesh R-CNN uses a differentiable mesh representation, allowing it to efficiently refine surface geometry while maintaining high spatial detail. The system combines 2D detection from Mask R-CNN with 3D reasoning modules that output full mesh reconstructions aligned with the input image. It has been evaluated on datasets such as Pix3D, where it demonstrates state-of-the-art performance in reconstructing real-world object geometry.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
11

Mozc Devices

Circuit diagrams and firmware source code for Gboard DIY keyboards

mozc-devices is an open source collection of circuit diagrams, firmware, and technical documentation for a series of experimental and often humorous Gboard and Google Japanese Input hardware keyboards, many of which were originally released as April Fools’ projects by Google Japan. Each subproject in the repository corresponds to a unique input device prototype, including versions such as the Drum Set, Morse Code, Patapata, Magic Hand, Piropiro, Physical Flick, Puchi Puchi, Nazoru, Mageru, Yunomi, Bar, Caps, Double Sided, and Dial editions. These devices creatively reinterpret how users can interact with Japanese text input, blending humor, engineering, and physical computing. The repository serves as an archive of the schematics, firmware, and PCB designs for these inventive input mechanisms, with many projects including promotional videos and technical references.

Downloads: 2 This Week

Last Update: 4 days ago
See Project
12

StoryMem

Official code for StoryMem: Multi-shot Long Video Storytelling

StoryMem is a narrative-focused memory accumulation system that lets users build, store, and reference past conversational context or story elements with an AI, effectively enabling the AI to maintain and recall personalized story memories or character arcs over time. Instead of treating each interaction as stateless, it tracks user-defined memory nodes, tags, and story threads so that future interactions can draw on established narrative context like character traits, past events, or ongoing plotlines. This makes it especially valuable for applications like AI-enriched storytelling, long-term role-playing, personal journaling assistants, or persistent game worlds where continuity and coherence matter. StoryMem includes tools to define, edit, and review saved memories, letting users manage what should be retained, forgotten, or emphasized in later conversations.

Downloads: 2 This Week

Last Update: 2026-07-22
See Project
13

Tilf

Tilf (Tiny Elf) is a free, simple yet powerful pixel art editor

Tilf (Tiny Elf) is a lightweight, cross-platform pixel art editor developed in Python with PySide6, designed for simplicity, speed, and freedom from account systems or installation overhead. It focuses on enabling artists to create sprites, icons, and small 2D assets quickly, without requiring setup, dependencies, or internet connectivity. Tilf provides a familiar drawing environment with essential tools—such as pencil, eraser, fill, eyedropper, rectangle, and ellipse—along with zoom, grid display, real-time preview, and undo/redo capabilities. It supports importing and exporting images in PNG, JPG, and BMP formats, including transparency options. With its single-executable builds for Windows, macOS, and Linux, Tilf can be run instantly and is ideal for both hobbyist pixel artists and developers needing a quick sketching tool for sprite work. The project emphasizes accessibility and minimalism over complexity, making it approachable even for users with no technical background.

Downloads: 2 This Week

Last Update: 2 days ago
See Project
14

Transparent Background

This is a background removing tool powered by InSPyReNet

This is a background-removing tool powered by InSPyReNet (ACCV 2022). You can easily remove the background from the image or video or bunch of other stuffs when you can make the background transparent! We basically follow the virtual camera settings from pyvirtualcam. If you do not choose to install virtual camera, it will visualize real-time output with cv2.imshow. Use another checkpoint file. Default is trained with composite dataset and will be automatically downloaded if not available.

Downloads: 2 This Week

Last Update: 2025-05-14
See Project
15

VSGAN

VapourSynth Single Image Super-Resolution Generative Adversarial

Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. VSGAN is released under the MIT License, ensuring it will stay free, with the ability to be used commercially.

Downloads: 2 This Week

Last Update: 2023-03-29
See Project
16

Verticals v3

Automated YouTube Shorts pipeline

Verticals v3 is an automated content generation workflow designed to create and process YouTube Shorts videos programmatically. It combines multiple tools and scripts to handle tasks such as downloading source material, editing clips, adding subtitles, and formatting output for vertical video platforms. The pipeline emphasizes automation, allowing users to produce short-form content at scale with minimal manual intervention. It integrates FFmpeg and other media processing tools to handle video transformations, resizing, and encoding. The system also supports adding overlays, captions, and audio enhancements to improve engagement. Designed for creators and developers, it enables repeatable workflows for generating social media content efficiently. Its modular structure allows customization of each stage in the pipeline, making it adaptable to different content strategies.

Downloads: 2 This Week

Last Update: 2026-06-09
See Project
17

YouTube-8M

Starter code for working with the YouTube-8M dataset

youtube-8m is Google’s open source starter code and reference implementation for training and evaluating machine learning models on the YouTube-8M dataset, one of the largest video understanding datasets publicly released. The repository provides a complete pipeline for video-level and frame-level modeling using TensorFlow, including data reading, model training, evaluation, and inference. It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.

Downloads: 2 This Week

Last Update: 3 days ago
See Project
18

plotly.py

The interactive graphing library for Python

plotly.py is a browser-based, open source graphing library for Python that lets you create beautiful, interactive, publication-quality graphs. Built on top of plotly.js, it is a high-level, declarative charting library that ships with more than 30 chart types. Everything from statistical charts and scientific charts, through to maps, 3D graphs and animations, plotly.py lets you create them all. Graphs made with plotly.py can be viewed in Jupyter notebooks, standalone HTML files, or hosted online using Chart Studio Cloud.

Downloads: 2 This Week

Last Update: 2026-07-10
See Project
19

rich

Rich is a Python library for rich text and beautiful formatting

The Rich API makes it easy to add color and style to terminal output. Rich can also render pretty tables, progress bars, markdown, syntax highlighted source code, tracebacks, and more, out of the box. Rich is a Python library for rich text and beautiful formatting in the terminal. Rich works with Linux, OSX, and Windows. True color/emoji works with new Windows Terminal, classic terminal is limited to 16 colors. Rich requires Python 3.7 or later. Effortlessly add rich output to your application, you can import the rich print method, which has the same signature as the builtin Python function. Rich can be installed in the Python REPL, so that any data structures will be pretty printed and highlighted. As you might expect, this will print "Hello World!" to the terminal. Note that unlike the builtin print function, Rich will word-wrap your text to fit within the terminal width.

Downloads: 2 This Week

Last Update: 2026-04-12
See Project
20

AnimateDiff

Plug-n-play module turning text-to-image models into animation

AnimateDiff is an open-source project designed to enhance text-to-image diffusion models by adding animation capabilities. It allows users to turn static images generated by popular text-to-image models into animated sequences without requiring additional model training. This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations with different visual styles, providing flexibility and ease of use for developers and artists interested in exploring dynamic, AI-generated animations.

1 Review

Downloads: 29 This Week

Last Update: 2025-03-06
See Project
21

imgSeek - Intelligent image database

imgSeek is a photo collection manager and viewer with content-based search and many other features. The query is expressed either as a rough sketch painted by the user or as another image you supply (or an image in your collection).

6 Reviews

Downloads: 11 This Week

Last Update: 2013-04-26
See Project
22

StickyPy

A Stick figure animation program written with Python(www.python.org) and PyGame(www.pygame.org). It is similar to Pivot, except with a much wider variety of animation features, such as keyframes.

1 Review

Downloads: 27 This Week

Last Update: 2013-04-15
See Project
23

Internet DJ Console

A feature packed DJ console and internet radio client for Linux users

Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no means complete. More info at the project homepage.

32 Reviews

Downloads: 8 This Week

Last Update: 2026-01-10
See Project
24

Audiere Audio System

Audiere is a portable audio library which supports playing MP3, Ogg Vorbis, FLAC, WAV, IT, XM, S3M, and MOD files. You can use it from C, C++, Python, Java, Delphi, and any language that supports XPCOM (JavaScript in Mozilla, for example).

5 Reviews

Downloads: 10 This Week

Last Update: 2013-04-25
See Project
25

PixelToPath

Convert PNG to SVG with a simple GUI tool.

PixelToPath is an open-source application that converts PNG images into scalable vector graphics (SVG) using the Potrace engine. Designed with simplicity in mind, it provides an intuitive graphical interface to adjust vectorization settings such as smoothing, threshold, and curve precision. PixelToPath is available as a standalone executable for Windows (no Python or installation required) and as a source version for Linux and Windows users who prefer customization. Potrace is fully integrated, allowing offline usage with no extra configuration. Whether you're a designer, developer, or hobbyist, PixelToPath makes bitmap-to-vector conversion fast, accessible, and efficient. The project is hosted on GitHub with source code and releases available for download.

1 Review

Downloads: 24 This Week

Last Update: 2026-03-15
See Project