Showing 110 open source projects for "media"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Telegram Media Downloader

    Telegram Media Downloader

    Download media files from a telegram conversation/chat/channel

    Download media files from a telegram conversation/chat/channel up to 2GiB per file.
    Downloads: 39 This Week
    Last Update:
    See Project
  • 2
    Postiz

    Postiz

    The ultimate social media scheduling tool, with a bunch of AI

    ...Learn from your data and improve your social media strategy. Track your performance and optimize your content.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    LLPlayer

    LLPlayer

    The media player for language learning, with dual subtitles

    LLPlayer is an open-source media player designed specifically for language learning through video content. Unlike traditional media players, the application focuses on advanced subtitle-related features that help learners understand and interact with foreign language media more effectively. The player supports dual subtitles so users can simultaneously view text in both the original language and their native language while watching videos.
    Downloads: 56 This Week
    Last Update:
    See Project
  • 4
    MoneyPrinterTurbo

    MoneyPrinterTurbo

    Generate short videos with one click using AI LLM

    MoneyPrinterTurbo is an AI-driven tool that enables users to generate high-definition short videos with minimal input. By providing a topic or keyword, the system automatically creates video scripts, sources relevant media assets, adds subtitles, and incorporates background music, resulting in a polished video ready for distribution.
    Downloads: 17 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Dolphin

    Dolphin

    Document Image Parsing via Heterogeneous Anchor Prompting”

    Dolphin — maintained by ByteDance — is a project aimed at providing a high-performance, robust, and extensible media or multimedia framework / player infrastructure (or possibly a streaming media solution), intended to meet modern demands for efficiency, flexibility, and integration in media-heavy applications. It seeks to combine performant media playback or handling (audio/video decoding, streaming, buffering) with a modular, developer-friendly API that allows easy embedding into larger applications or services. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    ...It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. It separates client-side media handling from backend AI processing, reducing data exposure while still enabling transcription and document generation. AI-Media2Doc supports flexible customization through prompts, allowing users to tailor output styles based on their needs. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    FaceFusion

    FaceFusion

    Industry leading face manipulation platform

    ...FaceFusion is built with a modular pipeline that allows users to customize processing steps and optimize performance for different hardware environments. The tool is often used in content creation, visual effects experimentation, and research into generative media. Overall, FaceFusion functions as a flexible and extensible platform for AI-driven face replacement and enhancement tasks.
    Downloads: 219 This Week
    Last Update:
    See Project
  • 8
    SuggestArr

    SuggestArr

    Request recommended movies, TV shows and anime to Jellyseer/Overseer

    SuggestArr is an open-source automation platform designed to recommend and automatically request movies, TV shows, and anime based on a user’s viewing history in self-hosted media servers. The project integrates with popular media management systems such as Jellyfin, Plex, and Emby, allowing it to analyze recently watched content and identify similar titles using metadata from the TMDb database. Once potential recommendations are identified, SuggestArr can automatically send download or request instructions to services like Jellyseer or Overseerr, which then coordinate with media download tools and libraries. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Telegram Search

    Telegram Search

    AI-powered Telegram chat backup and semantic search tool system

    ...It also incorporates AI-driven capabilities such as contextual question answering and unread message summarization, allowing users to interact with their message history in a more natural way. Additionally, it provides media handling features, including backup and semantic image search, enhancing discoverability.
    Downloads: 17 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    MoneyPrinter V2

    MoneyPrinter V2

    Automate the process of making money online

    MoneyPrinter V2 is an open-source automation platform designed to streamline and scale online income generation workflows by combining content creation, social media automation, and marketing strategies into a single system. It is a complete rewrite of the original MoneyPrinter project, focusing on modularity, extensibility, and broader functionality across multiple monetization channels. The platform operates primarily through Python-based scripts that automate tasks such as generating and publishing YouTube Shorts, posting on social media platforms like Twitter, and executing affiliate marketing campaigns. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 11
    Sora.FM

    Sora.FM

    Sora AI Video Generator by Sora.FM

    ...The project belongs to the growing class of “AI video generator / AI-assisted content creation” tools: it may use model-based generation, template-based editing, or combine video assets with generative models to automate parts of video creation or editing. For creators wanting to explore AI-based content generation — for example automated video clips, short-form media, or other generated video content — sorafm offers a starting point. As with many open-source generators in this space, the tradeoff lies in balancing ease-of-use and the limitations of generative output, but the fact that it’s publicly available means users can experiment, iterate, or fork to adapt pipelines: maybe customizing model prompts, video templates, or post-processing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    Medeo Video Generator

    Medeo Video Generator

    AI-powered video generation skill for OpenClaw

    ...The project focuses on bridging the gap between language-based AI systems and multimedia outputs by enabling models to produce structured video content as part of their workflows. It supports tasks such as video generation, editing, and transformation, making it useful for applications in content creation, marketing, and automated media production. The framework is designed to be modular, allowing developers to plug video capabilities into larger AI pipelines or agent systems. It emphasizes ease of integration and scalability, enabling both simple use cases and more complex multimedia workflows.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 13
    wa-automate-nodejs

    wa-automate-nodejs

    WhatsApp tool for chatbots with advanced features

    wa-automate-nodejs is the most advanced NodeJS library which provides a high-level API to control WA. Want to convert your WA account to an API instantly? You can now with the CLI. For more details see Easy API. After executing create() function, @open-wa/wa-automate will create an instance of WA web. If you are not logged in, it will print a QR code in the terminal. Scan it with your phone and you are ready to go! @open-wa/wa-automate will remember the session so there is no need to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    ShortGPT

    ShortGPT

    AI framework for automated short video creation and editing tools

    ...ShortGPT includes specialized content engines that manage different workflows, such as generating short videos, producing longer videos, and translating existing videos into other languages. It can automatically assemble videos by combining generated scripts, sourced media assets, captions, and synthesized voice narration. A modular editing system based on structured markup and JSON allows editing steps to be broken into manageable components that can be interpreted by language models.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    Groq Desktop

    Groq Desktop

    Local Groq Desktop chat app with MCP support

    ...The app is built with modern web technologies and packaged for native distribution, making it accessible even to non-developer users who just want to experiment with Groq models without writing code. Developers can also use groq-desktop-beta as a lightweight interface to test prompts, media inputs, or function-calling capabilities before embedding them into larger projects. The project offers installable builds (including via Homebrew on macOS) and supports easy setup, giving quick access to Groq’s inference services without needing to spin up a full backend.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 16
    Project NOMAD

    Project NOMAD

    Self-contained, offline survival computer with tools, knowledge, & AI

    Project N.O.M.A.D. (Node for Offline Media, Archives, and Data) is a self-contained, offline-first knowledge and education server designed to run on local hardware. It combines AI, educational resources, and critical information tools into a single platform accessible through a web browser. Built as a containerized system managed via Docker, it orchestrates multiple applications through a centralized “Command Center” interface.
    Downloads: 64 This Week
    Last Update:
    See Project
  • 17
    RisuAI

    RisuAI

    Make your own story. User-friendly software for LLM roleplaying

    RisuAI (or Risu) is a cross-platform AI roleplay chat application—available as both a desktop and web solution—offering creative story-building and character interaction experiences with support for multiple APIs, in-chat assets, regex capabilities, and more. Supports OpenAI, Claude, Gemini, DeepInfra, Ooba, OpenRouter... and More. Display the image of the current character, according to his/her expressions. Modify model's output by regex, to make a custom GUI and others.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 18
    Scene Framework

    Scene Framework

    Android Single Activity Framework compatible with Fragment

    Scene appears to be a ByteDance-hosted project — though at first glance its name is generic, implying it may relate to “scenes,” “rendering,” “storyboarding,” or perhaps “event handling.” Given ByteDance’s broad portfolio, Scene could be an internal or external library for structuring application “scenes” (UI, media, game, or module-level) or orchestrating workflows in a modular fashion. The repository may aim to help developers manage complex state, transitions, or UI/navigation flows in applications, or provide a structured framework for rendering or content pipelines. As such, Scene could serve as a backbone for media apps, games, or modular user interface projects, offering abstractions for scene management, resource loading, event dispatching, and lifecycle handling. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Vibe

    Vibe

    Transcribe on your own

    ...Because open-source social platforms benefit from transparency and community contribution, Vibe’s codebase allows developers to experiment with new social features, customize existing components, and build integrations with popular services for authentication, media sharing, and notifications. Projects like Vibe often emphasize scalability, responsive design, and extensibility so that communities of users can grow without major rewrites.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 20
    WhatsApp MCP Server

    WhatsApp MCP Server

    WhatsApp MCP server enabling AI access to chats and messaging

    ...All message data is stored in a local SQLite database and is only accessed when explicitly requested through defined tools, giving users control over how their data is used. It supports both sending and receiving messages, including various media types such as images, audio, videos, and documents. It integrates with AI applications like Claude through MCP, enabling conversational automation and contextual message retrieval.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    BettaFish

    BettaFish

    Public opinion analysis system

    BettaFish is an open-source, multi-agent public opinion analysis system built to automate the collection, deep analysis, and reporting of social media data at scale through conversational queries. It uses a modular architecture of specialized agents that collaborate to crawl mainstream platforms, extract multimodal content like text and short video, and synthesize insights through both statistical and large language model techniques. With a design that lets users pose questions in natural language and receive structured reports, charts, and visualizations, the system aims to break information cocoons and provide comprehensive views of trends and public sentiment. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Robyn

    Robyn

    Experimental, AI/ML-powered and open sourced Marketing Mix Modeling

    ...Its goal is to democratize rigorous MMM: what traditionally required expert statisticians and expensive consulting becomes accessible to any company with data. Robyn takes in historical data (spends on different marketing channels, conversions, or revenue, and optional context or organic-media variables) and uses a combination of techniques, regularized regression (Ridge), time-series decomposition (trend, seasonality, holiday effects), and hyperparameter optimization (via evolutionary algorithms), to estimate the incremental impact of each marketing channel. It explicitly models “carry-over” (adstock) and diminishing-returns (saturation) effects per channel, enabling realistic modeling of how advertising persists over time and saturates.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    Groq TypeScript / Node.s

    Groq TypeScript / Node.s

    The official Node.js / Typescript library for the Groq API

    ...It exports strongly-typed interfaces for models, chat completions, file uploads (e.g. for audio transcription), and other endpoints, allowing for better type safety and developer experience when using Groq from TypeScript. The library also supports passing different input types (file streams, blobs, fetch responses) for media-related endpoints, making it flexible for diverse environments (backend, browser, serverless). With this SDK, developers can call Groq’s models, transcribe audio, perform file uploads — all with minimal boilerplate — which streamlines creation of AI-enabled applications in the JavaScript/TypeScript ecosystem.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    AudioMuse-AI

    AudioMuse-AI

    AudioMuse-AI is an Open Source Dockerized environment

    ...By analyzing the underlying audio content rather than relying on external metadata services, the system can organize large personal music libraries and generate curated playlists for different moods or listening contexts. AudioMuse-AI integrates with several popular self-hosted music servers including Jellyfin, Navidrome, and Emby, allowing users to extend existing media servers with advanced AI-powered recommendation capabilities. The system uses machine learning and audio analysis tools such as Librosa and ONNX models to extract features directly from audio tracks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    LTX-2

    LTX-2

    Python inference and LoRA trainer package for the LTX-2 audio–video

    ...Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries, resource loaders, utilities for texture and buffer handling, and integration points for native event loops and input systems. The framework targets both interactive graphical applications and media-rich experiences, making it a solid foundation for games, creative tools, or visualization systems that demand both performance and flexibility. While being low-level, it also provides sensible defaults and helper abstractions that reduce boilerplate and help teams maintain clear, maintainable code.
    Downloads: 41 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB