Open Source Python Multimedia Software - Page 9

Sort By:

Python Multimedia Software

Multimedia Python Clear Filters

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

SPPAS

SPPAS - the automatic annotation and analyses of speech

SPPAS is a scientific computer software package written and maintained by Brigitte Bigi of the Laboratoire Parole et Langage, in Aix-en-Provence, France. Available for free, with open source code, there is simply no other package for linguists to simple use in the automatic annotations of speech, the analyses of any kind of annotated data and the conversion of annotated files. SPPAS is able to produce automatically speech annotations from a recorded speech sound and its orthographic transcription. SPPAS is helpful for the analysis of any annotated data: estimate statistical distributions, make requests, manage files, visualize annotations. SPPAS offers a file converter from/to a wide range of formats: xra, TextGrid, eaf, trs... <https://sppas.org>

Downloads: 43 This Week

Last Update: 3 days ago
See Project
2

Slim Camera

Slim Camera - Lightweight RTSP Video Player

Slim Camera is a lightweight RTSP viewer for IP cameras. On first launch, it prompts for the stream URL (saved for future sessions) and runs in the system tray to avoid taskbar clutter. It remembers window position, size, and camera URL via an INI file for seamless reuse. The interface keeps distractions minimal - just the video stream in an auto-sizing window. Right-click the tray icon to change the camera URL, restart the stream, reset window position, or exit. Press F1 to quickly modify the RTSP address. Optimized for low resource usage, it works reliably even on older hardware, making it perfect for background monitoring. With portable settings (single INI file) and focus on core functionality, Slim Camera delivers no-fuss video streaming for users who value simplicity. Support its free, open-source development with a donation at https://boosty.to/slim-camera/donate to help keep it ad-free and growing!

Downloads: 43 This Week

Last Update: 2025-06-19
See Project
3

glewpy

GLEWpy aims to bring advanced OpenGL extensions to Python. This allows the Python OpenGL developer to use features such as fragment/vertex shaders and image processing on the GPU. It serves as a compliment to PyOpenGL and toolkits such as GLUT and SDL.

Downloads: 42 This Week

Last Update: 2013-04-17
See Project
4

MLT Multimedia Framework

A multimedia authoring and processing framework and a video playout server for television broadcasting.

17 Reviews

Downloads: 7 This Week

Last Update: 2026-06-25
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

PyKaraoke

PyKaraoke is a cross-platform karaoke player. It currently supports CDG (MP3+G, OGG+G, WAV+G), MIDI (.KAR, .MID) and MPEG formats.

5 Reviews

Downloads: 9 This Week

Last Update: 2013-04-25
See Project
6

puddletag

SImple, powerful audio tagger for GNU/Linux

puddletag is an audio tag editor (primarily created) for GNU/Linux similar to the Windows program, Mp3tag. Unlike most taggers for GNU/Linux, it uses a spreadsheet-like layout so that all the tags you want to edit by hand are visible and easily editable. The usual tag editor features are supported like extracting tag information from filenames, renaming files based on their tags by using patterns and basic tag editing. Then there’re Functions, which can do things like replace text, trim it, do case conversions, etc. Actions can automate repetitive tasks. Doing web lookups using Amazon (including cover art), Discogs (does cover art too!), FreeDB and MusicBrainz is also supported. There’s quite a bit more, but I’ve reached my comma quota. Supported formats: ID3v1, ID3v2 (mp3), MP4 (mp4, m4a, etc.), VorbisComments (ogg, flac), Musepack (mpc), Monkey’s Audio (.ape) and WavPack (wv).

12 Reviews

Downloads: 7 This Week

Last Update: 2015-10-12
See Project
7

UniConvertor

Universal graphics translator

UniConvertor is an universal graphics translator. The project uses sK1 engine to convert one format to another. It has an import filters for: SVG, CDR, CDT, CMX, AI, XAR, CGM, WMF, XFIG, SK, SK1, SK2, CPL, ASE, ACO, JCW, GPL, SOC, SKP, PSD, XCF, PNG, JPG, TIFF, WEBP, BMP, PCX, PPM, XBM, XPM and export filters: SVG, AI, CDR, CMX, PDF, SK, SK1, SK2, CGM, WMF, CPL, ASE, ACO, JCW, GPL, SOC, SKP, PNG. This SourceForge project page is outdated. To download latest UniConvertor binaries, please visit official project site: https://sk1project.net/uc2/

4 Reviews

Downloads: 16 This Week

Last Update: 2019-11-17
See Project
8

Linux-Intelligent-Ocr-Solution

Easy-OCR solution and Tesseract trainer for GNU/Linux

Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?v=qLpCld4cdtk Source Code Github : https://github.com/Nalin-x-Linux/lios-3 Gitlab : https://gitlab.com/Nalin-x-Linux/lios-3 User guide is available in download page

5 Reviews

Downloads: 13 This Week

Last Update: 2020-10-19
See Project
9

Easy Background Remover

Free offline background remover for Windows - one click, no watermark

Easy Background Remover is a free, offline, AI-powered background remover for Windows 10 and 11. Drop in a photo and the background disappears in one click, leaving a clean transparent PNG. Your images never leave your computer - there is no upload, no sign-up, no account, no watermark and no limits, ever. The app runs the open source u2net neural network locally on your CPU through onnxruntime, so it works on any modern Windows PC and does not need a GPU. Pick a single photo or drop a whole folder for batch processing. The cutout pipeline mirrors the default u2net path of the well-known rembg library, so the quality matches what you would get from popular online background removers - without giving up your photos, paying for full resolution, or watching ads. The installer is tiny, has no telemetry, places a single desktop shortcut and removes itself after the first run. Private, free, unlimited, and released as free open source software under the MIT license.

Downloads: 37 This Week

Last Update: 1 day ago
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
10

EventGhost

EventGhost is an automation tool for MS Windows, that can be extended through plug-ins. Please visit http://www.eventghost.net/ to find more info and the latest release.

Downloads: 37 This Week

Last Update: 2016-10-04
See Project
11

MediaCrate — Video/Audio Downloader

Download video and audio from over 1,000+ websites with one click

MediaCrate is a lightweight desktop application for downloading video and audio from various websites, including YouTube, Instagram, TikTok, Facebook and many others. It's rather simple to use. Paste a link, select format and quality, and download. MediaCrate is designed with performance and simplicity in mind, maintaining minimal CPU usage while idle and a small memory footprint during downloads. Project links: Website: justagwas.com/projects/mediacrate GitHub: github.com/Justagwas/mediacrate Documentation: github.com/Justagwas/mediacrate/wiki The application is fully open source, runs entirely on your device, and only downloads content you explicitly request. VirusTotal scan result: https://www.virustotal.com/gui/file/aef68f8fb5c99a02072af39038f9c22113141239abd75f4cf4ff52d6ee486ba3

2 Reviews

Downloads: 13 This Week

Last Update: 2026-07-04
See Project
12

Quivi

This project is not maintained anymore. A new maintainer has been updating it in https://github.com/qazmlpok/quivi, but I'm not involved with it. Quivi is a comic / manga reader but also an general purpose image viewer for Windows which supports many file formats and compressed (zip, rar) files. It is aimed for fast & easy file browsing with keyboard or mouse.

8 Reviews

Downloads: 9 This Week

Last Update: 2024-11-04
See Project
13

Formatix Image Converter

Formatix Image Converter is a free batch image converter and resizer for Windows. Supports HEIC, AVIF, WebP, SVG, JPEG, PNG, BMP, TIFF, GIF and ICO. Works 100% offline — no uploads, no internet required, no file limits, no ads. Key Features: - Batch conversion — hundreds of images in one operation - Multi-threaded processing for fast performance - 5 resize modes including smart crop - Adjustable quality (JPEG, WebP, AVIF, HEIC) - Drag & Drop support - 6 interface languages: English, Русский, Українська, Deutsch, Suomi, 中文 - Light / Dark theme - Built-in side-by-side image comparison - No installation needed — just run the .exe Input Formats: JPEG, PNG, WebP, AVIF, HEIC/HEIF, SVG, BMP, TIFF, GIF, ICO Output Formats: AVIF, WebP, JPEG, HEIC, PNG, BMP, TIFF, ICO

Downloads: 35 This Week

Last Update: 2026-07-13
See Project
14

Kirstens Viewers

Opensource Created Custom Viewers For Virtual Worlds like SecondLife.

Kirsten’s Viewer is a fast, modern Third‑Party Viewer (TPV) for Second Life, registered under the official TPV directory. It’s built for creators, photographers, and advanced users who want a clean, modern viewer with a focus on high performance on high end PC's Anaglyph 3D Mode , OpenCL‑based Visual Effects, Aggressive Optimisation, vcpkg + PowerShell One‑Click Build Automation, Highly Tuned Graphics Path , Many Other Cutting‑Edge Features — ongoing experimental work, performance improvements, and creator‑driven enhancements continue to push the viewer forward.

11 Reviews

Downloads: 9 This Week

Last Update: 6 hours ago
See Project
15

A2M — Audio to MIDI

A2M converts piano recordings into editable MIDI files on Windows.

A2M (Audio to MIDI) is an open-source Windows application for converting piano recordings into editable MIDI files. It estimates notes, timing, velocity, sustain, and soft-pedal events, with the best results obtained from clean solo piano recordings. Choose an audio file, select CPU or a CUDA/DirectML mode, and click Convert to MIDI. The generated file is saved to Downloads/A2M by default, or to a custom folder selected in Settings. Audio analysis is performed locally. Source recordings are not uploaded, and no account is required. Network access may be used for requested downloads and update checks when enabled. CPU processing is available by default. Optional GPU acceleration can reduce processing time on compatible hardware. Website: https://www.justagwas.com/projects/a2m GitHub: https://github.com/Justagwas/A2M Docs: https://github.com/Justagwas/A2M/wiki VirusTotal: https://www.virustotal.com/gui/file/898c575338ee010597d1224dc5da8f3db9f35191f5ed1c91752f52d55b1211c

Downloads: 32 This Week

Last Update: 2026-07-17
See Project
16

Inkscape Table Support

This Inkscape extension provides table support for Inkscape. This is one of the features that are present in Corel Draw but are not present in Inkscape. So, just install this extension and you will have table support in Inkscape

5 Reviews

Downloads: 7 This Week

Last Update: 2015-11-21
See Project
17

HPINKJET

The Hewlett-Packard Co. Linux Inkjet Driver Project has moved!

2 Reviews

Downloads: 10 This Week

Last Update: 2013-03-13
See Project
18

PyOpenGL

PyOpenGL is the binding layer between Python and OpenGL.

2 Reviews

Downloads: 10 This Week

Last Update: 2013-05-27
See Project
19

Gimp Plug-in for Image Registration

This is an image registration plug-in for Gimp. Image registration (also known as image alignment) is the process of transforming one image so that it best matches another image. This is especially useful when you want to combine two or more images that are similar but geometrically misaligned. Visit the project web page at https://gimp-image-reg.sourceforge.io for detailed information on installing and using the plug-in. Starting with version 3.0.0, this project is licensed under the MIT License. Earlier versions (≤ 2.0.1) remain under GPL-3.0.

3 Reviews

Downloads: 8 This Week

Last Update: 2025-11-20
See Project
20

Pitcher's Duel

Pitchers Duel is a Baseball simulation focusing on the one-on-one contest between a batter and a pitcher. Will include both against-the-computer and networked interplayer variations.

Downloads: 28 This Week

Last Update: 2013-03-14
See Project
21

xSTUDIO

xSTUDIO is a high performance playback and review tool.

xSTUDIO is a high performance playback and review tool designed by and for Visual Effects, Animation and Post Production professionals. The application can load and play large collections of media files. The efficient playback engine allows you to quickly load and play high resolution image formats with a wide range of file formats and encoding. Intuitive tools allow you to create and organise playlists and media sub-sets within playlists to build interactive review sessions, image and video reference libraries. A multi-track timeline editing interface provides the facility for loading or creating edits from simple to complex.

Downloads: 28 This Week

Last Update: 2026-06-21
See Project
22

Freevo - Media Centre Platform

Freevo is an open source HTPC media centre integrating PVR / DVR funtionality along with music, video, gaming, home automation and more. It is written in python and uses existing popular software such as mplayer, xine and vlc.

1 Review

Downloads: 15 This Week

Last Update: 2013-06-05
See Project
23

AI Upscaler for Blender

AI Upscaler for Blender using Real-ESRGAN

Blender add-on to dramatically reduce render times using the Real-ESRGAN upscaler. Rendering an HD image in Blender takes 37 minutes. Upscaling can render a similar quality image in 5 mins total. Any PC or laptop can now do 3D rendering. 4k images can be rendered in the time it would take to render HD 1080p images. HD 1080p images can be rendered in record time on low-end hardware. Installation is easy. Just install the addon. No special hardware or GPU is required. Upscaling is done entirely on the CPU. Blender renders a low-resolution image. The Real-ESRGAN Upscaler upscales the low-resolution image to a higher-resolution image. Real-ESRGAN is a deep learning upscaler that uses neural networks to achieve excellent results by adding in detail when it upscales.

Downloads: 1 This Week

Last Update: 2023-08-08
See Project
24

Asteroid

The PyTorch-based audio source separation toolkit for researchers

The PyTorch-based audio source separation toolkit for researchers. Pytorch-based audio source separation toolkit that enables fast experimentation on common datasets. It comes with a source code thats supports a large range of datasets and architectures, and a set of recipes to reproduce some important papers. Building blocks are thought and designed to be seamlessly plugged together. Filterbanks, encoders, maskers, decoders and losses are all common building blocks that can be combined in a flexible way to create new systems. Extending the toolkit with new features is simple. Add a new filterbank, separator architecture, dataset or even recipe very easily. Recipes provide an easy way to reproduce results with data preparation, system design, training and evaluation in a single script. This is an essential tool for the community! The default logger is TensorBoard in all the recipes. From the recipe folder, you can run the following to visualize the logs of all your runs.

Downloads: 1 This Week

Last Update: 2023-10-12
See Project
25

ChatterBot

Machine learning, conversational dialog engine for creating chat bots

ChatterBot is a Python library that makes it easy to generate automated responses to a user’s input. ChatterBot uses a selection of machine learning algorithms to produce different types of responses. This makes it easy for developers to create chat bots and automate conversations with users. For more details about the ideas and concepts behind ChatterBot see the process flow diagram. The language independent design of ChatterBot allows it to be trained to speak any language. Additionally, the machine-learning nature of ChatterBot allows an agent instance to improve it’s own knowledge of possible responses as it interacts with humans and other sources of informative data. An untrained instance of ChatterBot starts off with no knowledge of how to communicate. Each time a user enters a statement, the library saves the text that they entered and the text that the statement was in response to. As ChatterBot receives more input the number of responses that it can reply increase.

Downloads: 1 This Week

Last Update: 2026-06-19
See Project