Open Source Python Multimedia Software - Page 9

Python Multimedia Software

View 4861 business solutions

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can operate in real time and supports variable bandwidths, bitrates, and multi-band audio. Encodec has applications in speech and music compression, generative modeling, and efficient data transmission for communication systems. The repository includes pretrained checkpoints, PyTorch inference code, and examples for integrating Encodec as a module in downstream generative or streaming systems.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    GIF for CLI

    GIF for CLI

    Takes in a GIF, short video, or a query to the Tenor GIF API

    gif-for-cli is a small, playful utility that brings animated GIFs to the command line by rendering frames directly in a terminal. It takes an input GIF (or a URL) and converts each frame into a terminal-friendly representation, timing updates to approximate the original animation. Depending on terminal capabilities, it can use ANSI color blocks or image protocols to achieve surprisingly faithful playback. The tool includes conveniences such as looping control, scaling to fit your terminal, and caching to avoid repeated downloads. It’s useful for quick visual feedback, lightweight demos, or just adding some personality to scripts and readmes. Despite its whimsical nature, the project is a tight example of frame decoding, timing, and terminal rendering glued together in a single command.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Image Downloader

    Image Downloader

    Download images from Google, Bing, Baidu

    Crawl and download images using Selenium Using python3 and PyQt5. Supported Search Engine: Google, Bing, Baidu. Keywords input from the keyboard or input from line separated keywords list file for batch process. Download image using a customizable number of threads. Fully supported conditional search (eg. filetype:, site:). Switch for Google safe mode. Proxy configuration (socks, HTTP). CMD and GUI ways of using are provided.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Photonix Photo Manager

    Photonix Photo Manager

    A modern, web-based photo management server

    A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms. This project is currently in development and not feature complete for a version 1.0 yet. If you don't mind putting up with broken parts or want to help out, run the Docker image and give it a go. I'd love for other contributors to get involved. You can move some photos into the folder data/photos and they should get detected and imported immediately. Once you have finished trying out the system you can edit the volume in the docker-compose.yml file where it says ./data/photos to mount wherever you usually keep photos. System database, thumbnails and other cache data is stored separately from the photos so shouldn't pollute the area. You are responsible for keeping your own backups in case of error.
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Pydub

    Pydub

    Manipulate audio with a simple and easy high level interface

    Manipulate audio with a simple and easy high level interface. You can pass an optional bitrate argument to export using any syntax ffmpeg supports. Any further arguments supported by ffmpeg can be passed as a list in a 'parameters' argument, with switch first, argument second. Note that no validation takes place on these parameters, and you may be limited by what your particular build of ffmpeg/avlib supports. You can open and save WAV files with pure python. For opening and saving non-wav files, like mp3, you'll need ffmpeg or libav. Any operations that combine multiple AudioSegment objects in any way will first ensure that they have the same number of channels, frame rate, sample rate, bit depth, etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Stable Diffusion Rembg

    Stable Diffusion Rembg

    Removes backgrounds from pictures. Extension for webui

    This project is an extension for the Stable Diffusion Web UI that removes backgrounds from images directly inside the interface. It wraps popular background-removal models so creators can take a generated or uploaded image and isolate the subject with a single click. The workflow is designed to be non-destructive: you can preview, tweak thresholds, and export either a transparent PNG or a masked layer for further editing. Because it runs within the Web UI, you can chain it with other operations such as upscaling, inpainting, or ControlNet to refine edges and composites. Batch processing helps clear backgrounds from whole sets of renders, which is useful for asset pipelines, catalogs, and thumbnails. The extension aims for convenience and predictable results, sparing users from round-tripping through separate editors just to knock out a background.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Super Tiny Icons

    Super Tiny Icons

    Super Tiny Icons are miniscule SVG versions of your favourite website

    SuperTinyIcons is a collection of brand and service logos distilled into extremely small, hand-tuned SVGs, often targeting sub-kilobyte file sizes. Each icon is crafted to preserve recognizable shapes with the fewest possible paths and nodes, trading photorealism for clarity at common UI sizes. The project emphasizes performance: tiny inline SVGs reduce network transfer, speed up rendering, and scale crisply on high-DPI displays. Designers and developers can embed the icons directly, recolor them via CSS, or combine them in sprites without raster assets. The repository maintains consistent viewboxes and alignment so icons sit neatly alongside text and other UI elements. It is especially useful for landing pages, status banners, and mobile experiences where every byte matters.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Surface Defect Detection Dataset Papers

    Surface Defect Detection Dataset Papers

    Constantly summarizing open source dataset and critical papers

    At present, surface defect equipment based on machine vision has widely replaced artificial visual inspection in various industrial fields, including 3C, automobiles, home appliances, machinery manufacturing, semiconductors and electronics, chemical, pharmaceutical, aerospace, light industry and other industries. Traditional surface defect detection methods based on machine vision often use conventional image processing algorithms or artificially designed features plus classifiers. Generally speaking, imaging schemes are usually designed by using the different properties of the inspected surface or defects. A reasonable imaging scheme helps to obtain images with uniform illumination and clearly reflect the surface defects of the object. In recent years, many defect detection methods based on deep learning have also been widely used in various industrial scenarios.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    VGGSfM

    VGGSfM

    VGGSfM: Visual Geometry Grounded Deep Structure From Motion

    VGGSfM is an advanced structure-from-motion (SfM) framework jointly developed by Meta AI Research (GenAI) and the University of Oxford’s Visual Geometry Group (VGG). It reconstructs 3D geometry, dense depth, and camera poses directly from unordered or sequential images and videos. The system combines learned feature matching and geometric optimization to generate high-quality camera calibrations, sparse/dense point clouds, and depth maps in standard COLMAP format. Version 2.0 adds support for dynamic scene handling, dense point cloud export, video-based reconstruction (1000+ frames), and integration with Gaussian Splatting pipelines. It leverages tools like PyCOLMAP, poselib, LightGlue, and PyTorch3D for feature matching, pose estimation, and visualization. With minimal configuration, users can process single scenes or full video sequences, apply motion masks to exclude moving objects, and train neural radiance or splatting models directly from reconstructed outputs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    VSGAN

    VSGAN

    VapourSynth Single Image Super-Resolution Generative Adversarial

    Single Image Super-Resolution Generative Adversarial Network (GAN) which uses the VapourSynth processing framework to handle input and output image data. Transform, Filter, or Enhance your input video, or the VSGAN result with VapourSynth, a Script-based NLE. You can chain models or re-run the model twice-over (or more). Have low VRAM? Don’t worry! The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. VSGAN is released under the MIT License, ensuring it will stay free, with the ability to be used commercially.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    You-Get

    You-Get

    Dumb downloader that scrapes the web

    You-Get is a small command-line utility for downloading media (video, audio and images) from the Web when there are no other means to do so. It can download video and audio files from such popular web sites as YouTube, Twitter, Niconico, Vimeo, Flickr, Instagram and a whole lot more. You-Get is a great option for when you want to enjoy your favorite videos, audio or images from the internet without having to open any web browsers or get interrupted by ads. It’s also a good choice for when you want to get rid of any closed-source technology or proprietary JavaScript code from running on your computer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    YouTube-8M

    YouTube-8M

    Starter code for working with the YouTube-8M dataset

    youtube-8m is Google’s open source starter code and reference implementation for training and evaluating machine learning models on the YouTube-8M dataset, one of the largest video understanding datasets publicly released. The repository provides a complete pipeline for video-level and frame-level modeling using TensorFlow, including data reading, model training, evaluation, and inference. It was developed to support the YouTube-8M Video Understanding Challenge (hosted on Kaggle and featured at ICCV 2019), enabling researchers and practitioners to benchmark video classification models on large-scale datasets with over millions of labeled videos. The code demonstrates how to process frame-level features, train logistic and deep learning models, evaluate them using metrics like global Average Precision (gAP) and mean Average Precision (mAP), and export trained models for MediaPipe inference.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    asciinema

    asciinema

    Open source terminal session recorder

    asciinema is a free and open source terminal session recorder. It lets you easily record and play back terminal sessions in the terminal or in a web browser. Forget old screen recording methods and resulting blurry videos. asciinema lets you record your terminal sessions the right way, which is right where you work, in the terminal. Recording is as easy as running one command, and since it’s purely text-based you can copy and paste any content you want, simply pause the recording! You can also easily share your recordings on the web, embed an asciicast player in your blog post, project documentation page or in your conference talk slides. See plenty of example sessions recorded with asciinema here: https://asciinema.org/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    pybaselines

    pybaselines

    Library of algorithms for baseline correction of experimental data

    pybaselines is a Python library that provides many different algorithms for performing baseline correction on data from experimental techniques such as Raman, FTIR, NMR, XRD, XRF, PIXE, etc. The aim of the project is to provide a semi-unified API to allow quick testing and comparing multiple baseline correction algorithms to find the best one for a set of data. pybaselines has 50+ baseline correction algorithms. These include popular algorithms, such as AsLS, airPLS, ModPoly, and SNIP, as well as many lesser-known algorithms. Most algorithms are adapted directly from literature, although there are a few that are unique to pybaselines, such as penalized spline versions of Whittaker-smoothing-based algorithms. The full list of implemented algorithms can be found in the documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    termtosvg

    termtosvg

    Record terminal sessions as SVG animations

    termtosvg is a Unix terminal recorder written in Python that renders your command line sessions as standalone SVG animations. Produce lightweight and clean-looking animations embeddable on a project page. Custom color themes, terminal UI, and animation controls via SVG templates. Compatible with asciinema recording format. termtosvg is compatible with Linux, macOS and BSD OSes, requires Python >= 3.5 and can be installed using pip.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    youtube-dlG

    youtube-dlG

    A cross platform front-end GUI of the popular youtube-dl

    A cross platform front-end GUI of the popular youtube-dl media downloader written in wxPython. Youtube-dl-gui uses youtube-dl in the backend to download files. Youtube-dl provides a list of extractors to work with each particular site. You can change the naming pattern by picking a different filename format under the Options>General tab. You can also use a custom pattern by setting the option to "Custom" and editing the output template field. You can change the number of max parallel downloads by editing the "workers_number" option in your settings.json file. You can add extra youtube-dl command line options in the commands box under the Options>Extra tab. You can use your own version of youtube-dl by editing the "youtubedl_path" option in your settings.json file and make it point to your own binary (e.g. /usr/local/bin).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    The Hewlett-Packard Co. Linux Inkjet Driver Project has moved!
    Downloads: 9 This Week
    Last Update:
    See Project
  • 18
    vido

    vido

    Video/Audio Downloader frontend for youtube-dl

    Vido is a Video/Audio Downloader frontend for the popular youtube downloader youtube-dl, a rewrite of ytd-gtk by the same team, updated to python 3 using pygobjects and GTK+. Vido now uses yt-dlp, an updated fork of youtube-dl for downloading videos. yt-dlp is faster and supports more sites than the original youtube-dl The program has been tested only on linux and the installation instructions are also provided for the same on our wiki. We do not provide support for windows/mac however if someone wishes to port/compile/distribute the program for windows/mac they are most welcome. Refer to our wiki for details
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19
    PixelToPath

    PixelToPath

    Convert PNG to SVG with a simple GUI tool.

    PixelToPath is an open-source application that converts PNG images into scalable vector graphics (SVG) using the Potrace engine. Designed with simplicity in mind, it provides an intuitive graphical interface to adjust vectorization settings such as smoothing, threshold, and curve precision. PixelToPath is available as a standalone executable for Windows (no Python or installation required) and as a source version for Linux and Windows users who prefer customization. Potrace is fully integrated, allowing offline usage with no extra configuration. Whether you're a designer, developer, or hobbyist, PixelToPath makes bitmap-to-vector conversion fast, accessible, and efficient. The project is hosted on GitHub with source code and releases available for download.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 20
    iPodder is a media aggregator which automatically downloads files to your computer or portable device, leaving you 'one-click-away' from latest media feeds. Based on the iPodder idea of Adam Curry.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    DR14 T.meter

    DR14 T.meter

    Compute the DR14 of a given audio file according to the procedure desc

    Introduction to DR14 T.meter DR14 T.meter is a free and opens source command line tool for computing the Dynamic Range of your music according to the procedure used in the off-line meter released by the Pleasurize Music Foundation. This tool is very useful to measure how is loud your music and for understanding that a good quality album always has also a good dynamic, and it's also useful for understanding the effects of the so called loudness war. Dr14 t.meter is released under the terms of the GNU GPLv3 license and is intended for the Linux users that can not use the Windows application released by the Pleasurize Music Foundation or the plugin for Foobar2000. I've largely tested this application and it has always returned results identical to those of the official off-line meter.
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 22
    Intention Repeater MAX

    Intention Repeater MAX

    Repeating your Intentions to aid in manifestation

    Please see the README.txt. The ServitorConnect 4443 and Python Daemon and Intention Repeater Android are better because repeating once-per-hour is better than millions of times per second (or even 3Hz). The archive bundle includes binaries and source code for: MAX and Simple Intention Repeaters CUDA version for Windows/Linux Memory Frequency Generator Multi-Format to WAV Repeater Android app Sourcecode File/Image Writers Nesting Files Creator Prayer Wheel Spiritual Chat Tarot WiFi Broadcast Whether you're seeking to manifest abundance, enhance your spiritual journey, or promote overall well-being, Intention Repeater MAX empowers you to harness the incredible strength of intention repetition. Compatible with Windows, macOS, and Linux. Download now and experience the life-changing benefits for yourself! 🙌
    Downloads: 12 This Week
    Last Update:
    See Project
  • 23
    PiStudio

    PiStudio

    A GIMP based, studio designed for industry pros & power users

    PiStudio is based off the GIMP heavily, and is portable. It is designed for industry professionals and advanced users. With the classic GIMP tools and new brushes added monthly, it is sure to suit all your graphic design needs. It has full support for a large array of graphics tablets and other advanced peripherals. It is set up to enhance productivity and ease of use. The PiFIRE logo is (C)Dangerous Pi 2016 PiNet is (C) Dangerous Pi 2017 All rights reserved. The software is GNU open source.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 24
    UniConvertor

    UniConvertor

    Universal graphics translator

    UniConvertor is an universal graphics translator. The project uses sK1 engine to convert one format to another. It has an import filters for: SVG, CDR, CDT, CMX, AI, XAR, CGM, WMF, XFIG, SK, SK1, SK2, CPL, ASE, ACO, JCW, GPL, SOC, SKP, PSD, XCF, PNG, JPG, TIFF, WEBP, BMP, PCX, PPM, XBM, XPM and export filters: SVG, AI, CDR, CMX, PDF, SK, SK1, SK2, CGM, WMF, CPL, ASE, ACO, JCW, GPL, SOC, SKP, PNG. This SourceForge project page is outdated. To download latest UniConvertor binaries, please visit official project site: https://sk1project.net/uc2/
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    Warlock-Studio

    Warlock-Studio

    AI Suite for upscaling, interpolating & restoring images/videos

    v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system designed to overcome GPU VRAM limitations.
    Downloads: 21 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB