Open Source Python Multimedia Software - Page 7

Python Multimedia Software

View 4896 business solutions

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    PyVista

    PyVista

    3D plotting and mesh analysis through a streamlined interface

    3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK). PyVista is a helper module for the Visualization Toolkit (VTK) that takes a different approach on interfacing with VTK through NumPy and direct array access. This package provides a Pythonic, well-documented interface exposing VTK’s powerful visualization backend to facilitate rapid prototyping, analysis, and visual integration of spatially referenced datasets. This module can be used for scientific plotting for presentations and research papers as well as a supporting module for other mesh-dependent Python modules. Easily integrate with NumPy and create a variety of geometries and plot them. You could use any geometry to create your glyphs, or even plot the points directly. Direct access to mesh analysis and transformation routines. Intuitive plotting routines with matplotlib similar syntax.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including Soulseek, YouTube, and Beatport, then verify file accuracy through AcoustID fingerprinting to ensure the correct audio was obtained. The platform also enriches tracks with lyrics, album art, and MusicBrainz metadata while organizing files into clean, customizable folder structures. Once processed, the library can be synchronized with media servers like Plex, Jellyfin, or Navidrome so collections stay up to date without manual intervention.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Spotify Music Downloader

    Spotify Music Downloader

    Spotify Music Downloader

    Download music from Spotify and other music sources.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Tauon

    Tauon

    The music player of today

    Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a playlist-oriented music player that makes the assumption that folders are albums. (You may find the words "album" and "folder" used interchangeably within the interface and documentation) While some features and interface elements may not be immediately intuitive, I hope that once you are familiar, you will find that Tauon is pleasant, fast, and fun to use.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Transparent Background

    Transparent Background

    This is a background removing tool powered by InSPyReNet

    This is a background-removing tool powered by InSPyReNet (ACCV 2022). You can easily remove the background from the image or video or bunch of other stuffs when you can make the background transparent! We basically follow the virtual camera settings from pyvirtualcam. If you do not choose to install virtual camera, it will visualize real-time output with cv2.imshow. Use another checkpoint file. Default is trained with composite dataset and will be automatically downloaded if not available.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    VGGSfM

    VGGSfM

    VGGSfM: Visual Geometry Grounded Deep Structure From Motion

    VGGSfM is an advanced structure-from-motion (SfM) framework jointly developed by Meta AI Research (GenAI) and the University of Oxford’s Visual Geometry Group (VGG). It reconstructs 3D geometry, dense depth, and camera poses directly from unordered or sequential images and videos. The system combines learned feature matching and geometric optimization to generate high-quality camera calibrations, sparse/dense point clouds, and depth maps in standard COLMAP format. Version 2.0 adds support for dynamic scene handling, dense point cloud export, video-based reconstruction (1000+ frames), and integration with Gaussian Splatting pipelines. It leverages tools like PyCOLMAP, poselib, LightGlue, and PyTorch3D for feature matching, pose estimation, and visualization. With minimal configuration, users can process single scenes or full video sequences, apply motion masks to exclude moving objects, and train neural radiance or splatting models directly from reconstructed outputs.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Videomass

    Videomass

    Videomass is a free, open source and cross-platform GUI for FFmpeg

    Videomass is a free, open-source graphical interface for FFmpeg designed to make advanced video and audio processing accessible to both beginners and experienced users. Built in Python using wxPython, it provides a cross-platform environment for managing encoding, conversion, and editing tasks through a visual interface. The software supports multitasking operations, allowing users to process multiple media files simultaneously. It offers extensive configuration options while also providing presets to simplify common workflows. Videomass integrates closely with FFmpeg, exposing powerful capabilities such as transcoding, filtering, and format conversion without requiring command-line interaction. It also supports scripting and customization for more advanced use cases. Overall, it combines usability and flexibility into a comprehensive multimedia processing tool.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    auto-subtitle

    auto-subtitle

    Automatically generate and overlay subtitles for any video

    auto-subtitle is a Python-based command-line tool that automatically generates and overlays subtitles on video files using AI-driven speech recognition. It combines FFmpeg with OpenAI’s Whisper model to transcribe spoken audio into text and synchronize it with video playback. The tool processes video input, extracts audio, and produces subtitle files that can be either exported separately or burned directly into the final video output. It supports multiple transcription models with varying accuracy and performance, allowing users to balance speed and quality depending on their needs. The system can also translate subtitles into English, enabling multilingual accessibility for video content. Once the required models are downloaded, it can operate offline, making it practical for local workflows. Designed for simplicity, it provides a streamlined way to automate subtitle creation without manual transcription effort.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    ffmpeg-normalize

    ffmpeg-normalize

    Audio Normalization for Python/ffmpeg

    ffmpeg-normalize is a command-line utility designed to normalize audio levels in media files using FFmpeg, ensuring consistent volume across multiple tracks. It supports both EBU R128 loudness normalization and peak normalization methods, allowing users to choose the appropriate standard for their needs. The tool analyzes audio streams and applies adjustments to achieve target loudness levels without introducing distortion. It can process multiple files in batch mode, making it suitable for large media libraries or production workflows. ffmpeg-normalize also preserves metadata and supports a wide range of input and output formats. Its design emphasizes accuracy and compliance with broadcasting standards. Overall, it provides a reliable solution for achieving consistent audio quality in multimedia content.
    Downloads: 3 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    PyOpenGL is the binding layer between Python and OpenGL.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 11
    GIF-Overlay

    GIF-Overlay

    Pure Floating GIF & Image Viewer

    Visit Website: https://duyxyz.github.io/GIF-Overlay/ GIF Overlay is a minimalist and high-performance Windows application designed for viewing GIFs and images with a unique floating experience. Whether you need a reference image for drawing, a desktop decoration, or a lightweight GIF player, GIF Overlay provides a borderless, premium interface that stays exactly where you need it.
    Downloads: 69 This Week
    Last Update:
    See Project
  • 12
    MediaCrate — Video/Audio Downloader

    MediaCrate — Video/Audio Downloader

    Download video and audio from over 1,000+ websites with one click

    MediaCrate is a lightweight desktop application for downloading video and audio from various websites, including YouTube, Instagram, TikTok, Facebook and many others. It's rather simple to use. Paste a link, select format and quality, and download. MediaCrate is designed with performance and simplicity in mind, maintaining minimal CPU usage while idle and a small memory footprint during downloads. Project links: Website: justagwas.com/projects/mediacrate GitHub: github.com/Justagwas/mediacrate Documentation: github.com/Justagwas/mediacrate/wiki The application is fully open source, runs entirely on your device, and only downloads content you explicitly request. VirusTotal scan result: https://www.virustotal.com/gui/file/5a9b70c40e0ceb8e8ac411252b56dabd678964dc8d3f804f1df438104e7499f6
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 13
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder. All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by default, with optional NVIDIA GPU acceleration for faster conversions. Project links: Website: justagwas.com/projects/a2m GitHub: github.com/Justagwas/a2m Documentation: https://github.com/Justagwas/a2m/wiki A2M is fully open source and operates only on the files you choose. VirusTotal scan result: https://www.virustotal.com/gui/file/cc2a961baaaac2f8932c2e9ed04f0c27a55309cc03ed0825e44c8af18e263ce6
    Leader badge
    Downloads: 68 This Week
    Last Update:
    See Project
  • 14
    Comix is a user-friendly, customizable image viewer. It is specifically designed to handle comic books, but also serves as a generic viewer. It reads images in ZIP, RAR or tar archives (also gzip or bzip2 compressed) as well as plain image files.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 15
    PyMOL Molecular Graphics System

    PyMOL Molecular Graphics System

    PyMOL is an OpenGL based molecular visualization system

    The Open-Source PyMOL repository has been moved to github: https://github.com/schrodinger/pymol-open-source We still use the pymol-users mailing list here on sourceforge. Please subscribe for community support: https://pymol.org/maillist (Note: SourceForge email newsletter and special offers are optional and can be unchecked) The PyMOL community wiki has its own home: https://pymolwiki.org/
    Downloads: 61 This Week
    Last Update:
    See Project
  • 16
    itom

    itom

    itom - an Open Source Measurement, Automation and Evaluation Software

    itom is an open source software suite for operating measurement systems, laboratory automation and data evaluation. One main application of itom is the development and operation of sensor and measurement system for instance in a laboratory environment. Therefore, the software has to be able to communicate with a wide range of different hardware systems, such as cameras or actuators and should provide a diversified and as complete as possible set of evaluation and data processing methods. Additionally, the rapid prototyping of modern measurement and inspection setups requires a system, where parameters or components can easily be changed at runtime, necessitating the availability of an embedded scripting language. Finally, when operating a measurement system, it is also desirable to extend the graphical user interface by system adapted dialogs and windows. The project has been moved mid 2023 to github: https://itom-project.github.io https://github.com/itom-project
    Downloads: 33 This Week
    Last Update:
    See Project
  • 17
    3D Gaussian Splatting

    3D Gaussian Splatting

    Original reference implementation of "3D Gaussian Splatting"

    Gaussian Splatting is the official implementation of “3D Gaussian Splatting for Real-Time Radiance Field Rendering,” a research project for reconstructing and rendering 3D scenes from collections of images. The system represents scenes as millions of optimized 3D Gaussians rather than traditional meshes or neural fields, allowing high-quality novel view synthesis with real-time rendering performance. It includes training scripts, rendering tools, scene conversion utilities, and viewers for inspecting generated results. The project is widely used in computer graphics, spatial capture, virtual production, research, and experimental 3D reconstruction workflows. It relies on image-based reconstruction pipelines such as COLMAP to estimate camera positions before optimizing the Gaussian representation. Overall, Gaussian Splatting has become a foundational reference implementation for modern real-time radiance field rendering.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    AI Upscaler for Blender

    AI Upscaler for Blender

    AI Upscaler for Blender using Real-ESRGAN

    Blender add-on to dramatically reduce render times using the Real-ESRGAN upscaler. Rendering an HD image in Blender takes 37 minutes. Upscaling can render a similar quality image in 5 mins total. Any PC or laptop can now do 3D rendering. 4k images can be rendered in the time it would take to render HD 1080p images. HD 1080p images can be rendered in record time on low-end hardware. Installation is easy. Just install the addon. No special hardware or GPU is required. Upscaling is done entirely on the CPU. Blender renders a low-resolution image. The Real-ESRGAN Upscaler upscales the low-resolution image to a higher-resolution image. Real-ESRGAN is a deep learning upscaler that uses neural networks to achieve excellent results by adding in detail when it upscales.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    FrankMocap

    FrankMocap

    A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

    FrankMocap is a monocular 3D human capture system that estimates body, hand, and optionally face pose from a single RGB image or video. It regresses parametric human models (e.g., SMPL/SMPL-X) directly, producing temporally stable meshes and joint angles suitable for animation or analytics. The pipeline couples a robust 2D keypoint detector with 3D mesh regression networks and priors that keep results anatomically plausible. It can run frame-by-frame or with temporal smoothing, and includes demo apps for live webcam capture as well as batch processing. Outputs include textured meshes, joint locations, and model parameters that can be exported to common DCC tools and game engines. The codebase offers pretrained models, clear inference scripts, and utilities to visualize results, making single-camera motion capture approachable on commodity hardware. Researchers and creators use it for motion studies, AR/VR prototyping, character animation, and human-in-the-loop editing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    MystiQ

    MystiQ

    Qt5/C++ FFmpeg Media Converter

    MystiQ is a cross-platform multimedia converter built with Qt and FFmpeg, designed to provide a modern graphical interface for video and audio processing tasks. It allows users to perform operations such as transcoding, trimming, and format conversion without needing to use command-line tools. The application supports a wide range of codecs and formats, enabling compatibility across devices and platforms. It includes batch processing capabilities, allowing multiple files to be converted simultaneously. MystiQ also provides customizable encoding parameters, giving users control over quality and performance. Its interface is designed to be intuitive while still exposing advanced features for experienced users. Overall, it combines ease of use with powerful multimedia processing capabilities.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    PersonaLive

    PersonaLive

    Expressive Portrait Image Animation for Live Streaming

    PersonaLive is an open-source diffusion-based portrait animation framework focused on generating expressive, long-duration animated sequences in real time, primarily for live streaming or interactive applications. It leverages deep generative models that condition on a static reference image and a driving input (such as motion or expression cues) to produce a seamless animated portrait sequence that can run indefinitely without segmentation artifacts. The framework prioritizes low-latency and streamable output, making it suitable for real-time creative workflows, broadcast overlays, or interactive avatars on consumer-grade GPUs. PersonaLive’s architecture balances visual quality and efficiency by combining motion encoding, temporal modules, and hybrid implicit control signals to preserve identity and stable expression through long sequences.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Pydub

    Pydub

    Manipulate audio with a simple and easy high level interface

    Manipulate audio with a simple and easy high level interface. You can pass an optional bitrate argument to export using any syntax ffmpeg supports. Any further arguments supported by ffmpeg can be passed as a list in a 'parameters' argument, with switch first, argument second. Note that no validation takes place on these parameters, and you may be limited by what your particular build of ffmpeg/avlib supports. You can open and save WAV files with pure python. For opening and saving non-wav files, like mp3, you'll need ffmpeg or libav. Any operations that combine multiple AudioSegment objects in any way will first ensure that they have the same number of channels, frame rate, sample rate, bit depth, etc.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    plotly.py

    plotly.py

    The interactive graphing library for Python

    plotly.py is a browser-based, open source graphing library for Python that lets you create beautiful, interactive, publication-quality graphs. Built on top of plotly.js, it is a high-level, declarative charting library that ships with more than 30 chart types. Everything from statistical charts and scientific charts, through to maps, 3D graphs and animations, plotly.py lets you create them all. Graphs made with plotly.py can be viewed in Jupyter notebooks, standalone HTML files, or hosted online using Chart Studio Cloud.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    pybaselines

    pybaselines

    Library of algorithms for baseline correction of experimental data

    pybaselines is a Python library that provides many different algorithms for performing baseline correction on data from experimental techniques such as Raman, FTIR, NMR, XRD, XRF, PIXE, etc. The aim of the project is to provide a semi-unified API to allow quick testing and comparing multiple baseline correction algorithms to find the best one for a set of data. pybaselines has 50+ baseline correction algorithms. These include popular algorithms, such as AsLS, airPLS, ModPoly, and SNIP, as well as many lesser-known algorithms. Most algorithms are adapted directly from literature, although there are a few that are unique to pybaselines, such as penalized spline versions of Whittaker-smoothing-based algorithms. The full list of implemented algorithms can be found in the documentation.
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB