Showing 381 open source projects for "media"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    YandexStation

    YandexStation

    Management of Yandex Station and other smart home devices

    YandexStation is a Home Assistant custom component that integrates Yandex-branded smart speakers and other devices with Alice into a unified smart home automation environment. It supports both local and cloud control, depending on the device type, with Yandex speakers often supporting both modes and third-party speakers typically limited to cloud control. The integration exposes playback and volume controls, as well as text-to-speech capabilities that send spoken messages in Alice’s voice...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Qwen-2.5-VL

    Qwen-2.5-VL

    Qwen2.5-VL is the multimodal large language model series

    Qwen2.5 is a series of large language models developed by the Qwen team at Alibaba Cloud, designed to enhance natural language understanding and generation across multiple languages. The models are available in various sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B parameters, catering to diverse computational requirements. Trained on a comprehensive dataset of up to 18 trillion tokens, Qwen2.5 models exhibit significant improvements in instruction following, long-text generation...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    ...The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. It also supports saving generated audio as downloadable files so users can reuse the speech outputs in other projects, presentations, or media content.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Cookiecutter Django

    Cookiecutter Django

    Framework for jumpstarting production-ready Django projects quickly

    ...Provides an optional basic ASGI setup for Websockets and an optional custom static build using Gulp and livereload. Send emails via Anymail (using Mailgun by default or Amazon SES if AWS is selected cloud provider, but switchable). Media storage using Amazon S3 or Google Cloud Storage. Docker support using docker-compose for development and production (using Traefik with LetsEncrypt support). Procfile for deploying to Heroku. Provides instructions for deploying to PythonAnywhere. You can run tests with unittest or pytest.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5
    SortPhotos

    SortPhotos

    SortPhotos is a Python script that organizes photos and videos

    ...With support for automation through launch agents or cron jobs, SortPhotos is well-suited for photographers, archivists, and anyone looking to streamline large personal or professional media collections.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    OpenHome Abilities

    OpenHome Abilities

    Open-source abilities for OpenHome agents

    ...Each ability is intentionally simple in structure, centering on a single main.py file that contains the core Python logic, which lowers the barrier to building and sharing custom behaviors. The system is meant to support a wide range of voice-driven actions, from API calls and media playback to quiz flows, device control, and multi-turn conversations, so it functions as a practical extension framework rather than a narrow template library. The repository includes official abilities maintained by the OpenHome team as well as community-contributed ones, creating both a stable baseline and a path for outside developers to publish their own work.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    TorchAudio

    TorchAudio

    Data manipulation and transformation for audio signal processing

    The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Bear Stone Smart Home

    Bear Stone Smart Home

    Custom Home Assistant configuration with automations and scripts setup

    ...It defines how various smart home devices, services, and integrations are organized and controlled within a single environment. It includes configuration files that manage entities such as lights, sensors, switches, and media devices, enabling centralized automation and monitoring. It demonstrates how to structure Home Assistant YAML files for scalability and maintainability in a real-world deployment. Bear Stone Smart Home also showcases custom automations and scripts designed to improve convenience, energy efficiency, and overall smart home behavior. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    spider_collection

    spider_collection

    Collection of Python web scraping scripts for data extraction tasks

    spider_collection is a collection of Python web crawler scripts created primarily for experimentation, learning, and practical scraping tasks. spider_collection gathers multiple independent spiders designed to collect data from different platforms and services, demonstrating a variety of scraping techniques and workflows. These crawlers make use of common Python scraping tools such as requests, parsel, BeautifulSoup, and the Scrapy framework to extract structured information from web pages....
    Downloads: 1 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    GLM-4.5V

    GLM-4.5V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    ex-skill

    ex-skill

    Distill your ex into an AI Skill

    ex-skill is an experimental AI tooling project that allows users to transform personal memories, particularly past relationships, into interactive AI “skills” that replicate the communication style, personality, and behavioral patterns of a specific individual. The system works by ingesting various forms of personal data such as chat logs, social media content, photos, and user-provided descriptions, then structuring this information into a layered representation that combines memory and persona modeling. It is designed to run within Claude Code environments, where users can generate, manage, and interact with these personalized AI entities through command-based interfaces. The project emphasizes emotional realism by reconstructing conversational tone, habits, and contextual memories, enabling interactions that feel consistent with the original person.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Reader 3

    Reader 3

    Quick illustration of how one can easily read books together with LLMs

    ...The interface focuses on clarity and ease of use, offering straightforward navigation of book chapters rather than full-featured e-reading capabilities. While it lacks advanced features like built-in annotations or rich media support, its simplicity is intentional, enabling users to quickly load EPUBs, view them in a browser, and even repurpose text for downstream tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    NExT-GPT

    NExT-GPT

    Code and models for ICML 2024 paper, NExT-GPT

    ...The system connects a large language model with multimodal encoders and diffusion-based decoders so it can interpret information from different sensory formats and generate responses in different media types. This architecture allows the model to convert between modalities, such as generating images from text descriptions or producing audio or video outputs based on textual prompts. The project also introduces instruction-tuning strategies that enable the model to perform complex multimodal reasoning and generation tasks with minimal additional parameters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    BlogWizard

    BlogWizard

    Generate blog articles from video or audio

    BlogWizard is a demo/utility project built on top of Groq’s LLM infrastructure that converts video or audio content into well-structured blog posts, enabling creators to repurpose multimedia content into text — useful for SEO, accessibility, or reaching audiences that prefer reading. The tool uses transcription (e.g. via Whisper) to extract text from audio/video, then runs an LLM-based generation pipeline to transform that content into coherent, readable blog-format posts — with sections,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    shot-scraper

    shot-scraper

    A command-line utility for taking automated screenshots of websites

    ...The project is deeply integrated with automation workflows: examples show it running in scheduled jobs, GitHub Actions, and bots that publish screenshots to social media or use them in docs. It ships with detailed documentation, a tutorial, and a template repository that lets you spin up an automated screenshot pipeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Mountain Island Media Center

    Mountain Island Media Center

    Create & display worship media for display on large TVs or projectors

    Create and display presentation files containing hymns from 2 included databases of public domain hymns. Create and display presentation files containing Bible passages. Organize worship flow by sequencing multiple presentation files. Windows users should review the Files/README.txt file prior to downloading the installer.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Links Into Social Media Posts

    Links Into Social Media Posts

    Provide a list of links, get back a CSV of social media post drafts

    ## About: Instantly mass-produce many social media posts using just your links! Turn your big list of website links into ready-to-use social media post drafts. This program automatically web-scrapes each link and generates a suitable title and 5 hashtags. ### Here’s a sample of results: title,url,hashtags Skelegant - itch.io,https://skelegant.itch.io,#skelegant #itch #social #media #share APHRODITE by Skelegant: A cyberpunk reskin for vanilla Doom,https://skelegant.itch.io/aphrodite,#aphrodite #skelegant #cyberpunk #reskin #vanilla Black Magwell by Skelegant: Dehacked-powered gunplay,https://skelegant.itch.io/black-magwell,#black #magwell #skelegant #dehacked-powered #gunplay "Eraser Weapons by Skelegant: The void beckons, go kick its ass",https://skelegant.itch.io/eraser-weapons,#eraser #weapons #skelegant #void #beckons All I gave it were the original links. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    xSTUDIO

    xSTUDIO

    xSTUDIO is a high performance playback and review tool.

    xSTUDIO is a high performance playback and review tool designed by and for Visual Effects, Animation and Post Production professionals. The application can load and play large collections of media files. The efficient playback engine allows you to quickly load and play high resolution image formats with a wide range of file formats and encoding. Intuitive tools allow you to create and organise playlists and media sub-sets within playlists to build interactive review sessions, image and video reference libraries. A multi-track timeline editing interface provides the facility for loading or creating edits from simple to complex.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 19
    Internet DJ Console

    Internet DJ Console

    A feature packed DJ console and internet radio client for Linux users

    Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 20
    Syncplay

    Syncplay

    Synchronize your playback over the Internet

    Syncplay synchronises the position and play state of multiple media players so that the viewers can watch the same thing at the same time. This means that when one person pauses/unpauses playback or seeks (jumps position) within their media player then this will be replicated across all media players connected to the same server and in the same 'room' (viewing session). When a new person joins they will also be synchronised.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    MagicBox Player
    ...IPTV/Streaming Ready: Easily load and manage M3U/M3U8 playlists for streaming live TV channels or individual online media streams. Compact Mini Mode: Switch to a Mini Player for a seamless, space-saving playback experience while you multitask. Metadata Integration: Automatically fetches and displays song/video metadata (Title, Artist, Album, Copyright) and includes quick tools to search for media on external platforms like YouTube.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    MDAC (Media Downloader and Converter)

    MDAC (Media Downloader and Converter)

    Batch download multiple videos and convert them into a smaller size.

    Downloads videos with Yt-Dlp then converts them with ffmpeg. You can also just download videos in batch or select a folder with videos to convert.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    AMV-Converter

    AMV-Converter

    Converts Videos to .AMV (Used For Cheap MP3 Players)

    AMV Converter is a lightweight tool designed to convert videos to the AMV (Actions Media Video) format — a proprietary format used primarily in older MP4/MP3 players. Built using ffmpeg, this converter is optimized to produce AMV files with smaller file sizes, perfect for low-storage portable devices. The tool also includes a built-in video downloader powered by yt-dlp, with support for a wide range of video formats and subtitle/audio preservation options.
    Leader badge
    Downloads: 153 This Week
    Last Update:
    See Project
  • 24
    whatsapp-api-client-python
    This library helps you easily create a Python application with WhatsApp API. https://green-api.com/en/
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Tartube

    Tartube

    Download videos/channels/playlists from YouTube and many other sites

    Tartube is a GUI front-end for youtube-dl, yt-dlp and other compatible video downloaders. It is written in Python 3 / Gtk 3 and runs on MS Windows, Linux, BSD and MacOS.
    Leader badge
    Downloads: 999 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB