Open Source Python Multimedia Software - Page 5

Python Multimedia Software

View 4896 business solutions

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    Kokua viewer

    Kokua viewer

    Kokua is a V3 based open source metaverse viewer project.

    Kokua is an open source metaverse viewer project based on the Second Life Viewer source. Disclaimer: This software is not provided or supported by Linden Lab, the makers of Second Life. The goal of Kokua is to greatly improve the usability of the Viewer through community involvement, thoughtful design, modern development methods, and a pro-change atmosphere.
    Leader badge
    Downloads: 196 This Week
    Last Update:
    See Project
  • 2
    This program, called "rebuild_db", allows the iPod shuffle to be used without iTunes. It rebuilds the iPod shuffle's databases based on the actual content of the iPod's file system, without restrictions regarding the directory structure.
    Downloads: 47 This Week
    Last Update:
    See Project
  • 3
    Clarity AI Upscaler

    Clarity AI Upscaler

    AI Image Upscaler & Enhancer

    Clarity AI Upscaler is an open-source AI image enhancement tool designed to increase the resolution and visual quality of images using modern generative techniques. The system uses deep learning models based on diffusion and other image generation methods to reconstruct high-resolution versions of low-resolution images while preserving important visual details. Unlike traditional interpolation-based upscaling algorithms, the system generates additional visual information that improves perceived clarity and sharpness. The project is intended as a free and open alternative to commercial AI upscaling tools, allowing developers and digital artists to run the technology locally or integrate it into their own workflows. The repository includes a full application environment with scripts, configuration files, and model support that allow users to run the upscaler as a standalone tool or integrate it into other pipelines.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    GIMP ML

    GIMP ML

    AI for GNU Image Manipulation Program

    This repository introduces GIMP3-ML, a set of Python plugins for the widely popular GNU Image Manipulation Program (GIMP). It enables the use of recent advances in computer vision to the conventional image editing pipeline. Applications from deep learning such as monocular depth estimation, semantic segmentation, mask generative adversarial networks, image super-resolution, de-noising and coloring have been incorporated with GIMP through Python-based plugins. Additionally, operations on images such as edge detection and color clustering have also been added. GIMP-ML relies on standard Python packages such as numpy, scikit-image, pillow, pytorch, open-cv, scipy. In addition, GIMP-ML also aims to bring the benefits of using deep learning networks used for computer vision tasks to routine image processing workflows.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Monkey-DL

    Monkey-DL

    Bulk download your favourite anime episodes from your favourite anime

    Monkey-DL is a command-line media downloader designed to retrieve video and audio content from online platforms with flexibility and automation. It integrates with tools like FFmpeg to handle post-processing tasks such as merging streams, converting formats, and optimizing output quality. The tool supports downloading single media files or entire playlists, enabling efficient batch operations. It includes options for selecting resolution, format, and output structure, giving users fine control over downloads. monkey-dl is built for simplicity, providing straightforward commands while still supporting advanced configurations. Its lightweight design makes it suitable for scripting and integration into automation workflows. Overall, it serves as a practical solution for downloading and organizing online media content.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Importer library to import assets from different common 3D file formats such as Collada, Blend, Obj, X, 3DS, LWO, MD5, MD2, MD3, MDL, MS3D and a lot of other formats. The data is stored in an own in-memory data-format, which can be easily processed. www.open3mod.com/ is a 3D model viewer and exporter based on Assimp that is also Open Source.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 7
    AV1 AVIF

    AV1 AVIF

    AV1 Image File Format Specification - ISO-BMFF/HEIF derivative

    AV1 AVIF is the official specification and reference design for the AV1 Image File Format (AVIF), defining how AV1-encoded bitstreams are packaged into the HEIF container format (based on ISOBMFF) to produce AVIF files. The project outlines the syntax and semantics required for AVIF compliance, including support for multiple image profiles, color depths, chroma subsampling modes, HDR/WCG, alpha channels, animation/image sequences, and various color-space/bit-depth combinations — making AVIF a versatile, modern image format suitable for both simple photos and advanced imagery needing high fidelity. The specification ensures interoperability across encoders and decoders, providing guidelines so that images created by any compliant AVIF encoder can be reliably decoded by compliant decoders. As adoption grows, AV1 AVIF plays a crucial role in promoting a royalty-free, open, high-efficiency image standard that competes with older formats such as JPEG and newer proprietary ones.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    Kornia

    Kornia

    Open Source Differentiable Computer Vision Library

    Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors. With Kornia we fill the gap between classical and deep computer vision that implements standard and advanced vision algorithms for AI. Our libraries and initiatives are always according to the community needs.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Mirrorcast

    Mirrorcast

    Open Source Alternative to Chromecast, Mirror Desktop and Play media r

    The idea is to replicate what Chromecast can do in regards to screen mirroring and streaming media to a remote display. Google chromes screen mirroring feature works well when used with a receiver such as Chromecast but this is a proprietary solution and audio does not work for desktop mirroring on some operating systems. At the moment, there is only a client for Debian/Ubuntu Operating systems and a server/receiver application for Raspberry pi. Mirrorcast aims to be a low latency screen mirroring solution with high-quality video and audio at 25-30fps, the later is why we will not use something like VNC. Mirrorcast uses up about the same amount of system resources as google chromes cast feature. The delay is less than 1 second on most networks. To achieve this we will use existing FOSS software such as ffmpeg, mpv, and omxplayer.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Render Farm Manager, Project Tracker.

    Render Farm Manager, Project Tracker.

    CGRU: Afanasy render farm manager and RULES project tracker.

    CGRU is an open source CG tools pack, includes Afanasy render farm manager and RULES project tracker.
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 11
    PyTube Downloader

    PyTube Downloader

    Let's quickly download YouTube videos & playlists with one click.

    PyTube Downloader lets you quickly download YouTube videos and playlists with one click. Choose from 144p to 8K quality and download multiple videos simultaneously. PyTube Downloader 让你轻松一键下载 YouTube 视频和播放列表。支持选择 144p 到 8K 的质量,并支持同时下载多个视频。
    Downloads: 81 This Week
    Last Update:
    See Project
  • 12
    PhotoFilmStrip
    PhotoFilmStrip creates movies out of your pictures in just 3 steps. First select your photos, customize the motion path and render the video. There are several output possibilities for VCD, SVCD, DVD up to FULL-HD. Creates animated slideshows.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 13
    Kirstens Viewers

    Kirstens Viewers

    Opensource Created Custom Viewers For Virtual Worlds like SecondLife.

    Kirsten’s Viewer is a fast, modern Third‑Party Viewer (TPV) for Second Life, registered under the official TPV directory. It’s built for creators, photographers, and advanced users who want a clean, modern viewer with a focus on high performance on high end PC's Anaglyph 3D Mode , OpenCL‑based Visual Effects, Aggressive Optimisation, vcpkg + PowerShell One‑Click Build Automation, Highly Tuned Graphics Path , Many Other Cutting‑Edge Features — ongoing experimental work, performance improvements, and creator‑driven enhancements continue to push the viewer forward.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 14
    AutoSub

    AutoSub

    A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

    AutoSub is a Python-based tool designed to automatically generate subtitles for video or audio content using speech recognition technology. It processes media files by extracting audio, transcribing spoken content, and generating subtitle files in standard formats. The tool supports multiple languages and can integrate with translation systems to produce subtitles in different languages. It is designed for automation, allowing batch processing of multiple media files. AutoSub leverages FFmpeg for media handling and integrates with speech recognition engines for transcription. It is particularly useful for content creators who want to quickly produce subtitles without manual effort. Overall, it simplifies the process of making media content accessible and searchable.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    DeepSpeech

    DeepSpeech

    Open source embedded speech-to-text engine

    DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    Image-Editor

    Image-Editor

    AI based photo editing website for changing image background

    Welcome to Image-Editor, the AI-based photo editing website that lets you change backgrounds, colors, crop, sharpen images, and much more with just a single click. With exceptional image quality and fast processing times, Image-Editor is the ultimate tool for all your photo editing needs. To get started, simply run pip install -r requirements.txt to download all the necessary libraries. Then to, create a new Django project using django-admin startproject Website1, replacing 'Website1' with the name of your choice. Image-Editor uses Python's cv2 library, which provides an easy and efficient way to work with images and videos, including a wide range of image processing and computer vision algorithms. With cv2, you can easily read, write, filter, and display images, and much more. Image-Editor uses Mediapipe's selfie_segmentation model for background removal in real-time video streams. This advanced model uses deep neural networks to detect and remove the background.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    LingBot-Map

    LingBot-Map

    A feed-forward 3D foundation model for reconstructing scenes

    LingBot-Map is a specialized project focused on mapping conversational or linguistic interactions within chatbot or AI-driven systems, providing a structured way to visualize and organize dialogue flows. It is designed to help developers understand how conversations evolve across different states, enabling better debugging and optimization of chatbot behavior. The system emphasizes mapping relationships between intents, responses, and transitions, creating a clear representation of conversational logic. It can be particularly useful in complex chatbot systems where multiple branches and conditions need to be managed effectively. The project supports extensibility, allowing developers to adapt the mapping system to different chatbot frameworks or AI models. Its design encourages clarity and transparency in conversational design, reducing ambiguity in dialogue flows. Overall, lingbot-map serves as a tool for improving the structure and reliability of conversational AI systems.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including Soulseek, YouTube, and Beatport, then verify file accuracy through AcoustID fingerprinting to ensure the correct audio was obtained. The platform also enriches tracks with lyrics, album art, and MusicBrainz metadata while organizing files into clean, customizable folder structures. Once processed, the library can be synchronized with media servers like Plex, Jellyfin, or Navidrome so collections stay up to date without manual intervention.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    You-Get

    You-Get

    Dumb downloader that scrapes the web

    You-Get is a small command-line utility for downloading media (video, audio and images) from the Web when there are no other means to do so. It can download video and audio files from such popular web sites as YouTube, Twitter, Niconico, Vimeo, Flickr, Instagram and a whole lot more. You-Get is a great option for when you want to enjoy your favorite videos, audio or images from the internet without having to open any web browsers or get interrupted by ads. It’s also a good choice for when you want to get rid of any closed-source technology or proprietary JavaScript code from running on your computer.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    FF Multi Converter
    FF Multi Converter is a simple graphical application which enables you to convert audio, video, image and document files between all popular formats, using and combining other programs. It uses ffmpeg for audio/video files, unoconv for document files and ImageMagick for image file conversions.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 21
    Impressive
    Impressive is a program that displays PDF presentation slides with style. Smooth alpha-blended slide transitions are provided for the sake of eye candy, but in addition to this, Impressive offers some unique tools that are very useful for presentations.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 22
    Natron

    Natron

    Open source, cross-platform compositing software

    Natron is an open-source, cross-platform nodal compositing software.
    Leader badge
    Downloads: 118 This Week
    Last Update:
    See Project
  • 23
    PyKaraoke is a cross-platform karaoke player. It currently supports CDG (MP3+G, OGG+G, WAV+G), MIDI (.KAR, .MID) and MPEG formats.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 24
    FontForge Windows builds

    FontForge Windows builds

    Unofficial Windows builds of FontForge

    The aim of this project is to compile up-to-date Windows builds of FontForge. For 'stable' builds, see https://github.com/fontforge/fontforge/releases The build system used was based off that offered by Matthew Petroff (http://www.mpetroff.net/software/fontforge-windows/), but has since been practically rewritten. New in 11/07/2020: * Synced with the 20201107 release. New in 06/04/2020: * Updated to latest master, picks up a clipboard copying fix New in 14/03/2020: * Synced with the 20200314 release. New in 01/03/2020: * Updated to latest master, now built with CMake. (prerelease) New in 02/06/2019: * The 32-bit build now uses Python 3 (3.7) instead of Python 2. No further Python 2 builds will be provided. * The GDK3 backend is now used. VcXsrv is no longer bundled. New in 31/07/2017: KNOWN ISSUES: * CTRL-C from console no longer interrupts/stops FontForge
    Leader badge
    Downloads: 35 This Week
    Last Update:
    See Project
  • 25
    OpenPiano — Virtual Piano for Windows

    OpenPiano — Virtual Piano for Windows

    Desktop piano playable with a PC keyboard, mouse, or MIDI device.

    OpenPiano is a Windows desktop piano application that allows you to play, practice, and record music using your PC keyboard, mouse, or a MIDI device. It supports real-time playback using SoundFonts and provides on-screen piano layouts for visual feedback while playing. OpenPiano is designed to run entirely locally. It does not require accounts, cloud services, or an internet connection for core functionality. Project links: Website: https://www.justagwas.com/projects/openpiano GitHub: https://github.com/Justagwas/openpiano Documentation: https://github.com/Justagwas/openpiano/wiki The application is fully open source. All input handling, playback, and MIDI recording/export are performed on the user's machine. VirusTotal scan result: https://www.virustotal.com/gui/file/6b55150577f852dbb4909c9590c902416db5e552ef849f2f44cd6f34251cfbd9
    Leader badge
    Downloads: 115 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB