Showing 79 open source projects for "images videos ai"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    ai-renamer

    ai-renamer

    A Node.js CLI that uses Ollama and LM Studio models

    ai-renamer is a Node.js-based command-line tool that uses large language models to automatically rename files based on their content, enabling more meaningful and organized file management. Instead of relying on manual naming or metadata, the tool analyzes the actual content of files, including images, videos, and documents, to generate descriptive and context-aware filenames.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Jaaz

    Jaaz

    Open source multimodal creative AI assistant with infinite canvas tool

    Jaaz is an open source multimodal creative assistant designed to help users generate and organize visual media using artificial intelligence. It functions as a creative workspace where images, videos, and visual storyboards can be produced and arranged on an infinite canvas environment. It combines AI agents with visual editing tools, allowing users to generate media through prompts, sketches, or simple instructions. Jaaz supports multiple AI models and can integrate both local and cloud-based inference systems, enabling flexible creative workflows. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Short Video Factory

    Short Video Factory

    AI tool for automatic batch short video creation and editing

    Short Video Factory is an open source desktop application designed to simplify the creation of short-form videos using AI-driven automation. It enables users to generate product marketing clips and general content videos by combining simple prompt-based input with pre-prepared media assets. Short Video Factory integrates multiple stages of video production, including script generation, voice synthesis, video editing, and subtitle effects, into a single streamlined workflow. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    OpenShorts

    OpenShorts

    Free & open source AI video platform

    ...It also supports generating marketing videos using AI actors, voiceovers, and scripted narratives without requiring cameras or production resources. The platform integrates publishing capabilities, allowing users to distribute content directly to TikTok, Instagram, and YouTube. Its architecture uses modern technologies such as FastAPI, FFmpeg, and AI models for transcription, analysis, and rendering.
    Downloads: 10 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 5
    Upscayl

    Upscayl

    Free and Open Source AI Image Upscaler for Linux, MacOS and Windows

    Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy. Upscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes. Upscayl does not work without a GPU, sorry. You'll need a Vulkan-compatible GPU to upscale images.
    Downloads: 158 This Week
    Last Update:
    See Project
  • 6
    Generative AI for beginners with JS

    Generative AI for beginners with JS

    Join a time-traveling adventure where you meet history’s legends

    ...Each lesson includes written explanations, hands-on exercises, quizzes, and supporting videos to help developers learn the material progressively. Topics covered include prompt engineering, building AI-powered applications, working with structured outputs, integrating retrieval-augmented generation, and enabling tool or function calling in AI systems. The repository focuses specifically on how generative AI can be integrated into web, mobile, or desktop applications using JavaScript frameworks and APIs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    MCP YouTube

    MCP YouTube

    A Model-Context Protocol Server for YouTube

    The YouTube MCP Server uses yt-dlp to download subtitles from YouTube videos and connects to claude.ai via the Model Context Protocol. It enables AI assistants to summarize YouTube videos by accessing their subtitles. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Diffusion Bee

    Diffusion Bee

    Diffusion Bee is the easiest way to run Stable Diffusion locally

    Diffusion Bee is a user-friendly local application designed to make running the Stable Diffusion text-to-image generative model as simple as possible on macOS machines, including both Intel and Apple Silicon. It wraps Stable Diffusion and its dependencies into a one-click installer so users don’t need to manually install Python, drivers, or machine-learning frameworks to generate images. The app runs entirely on the local machine so images are created offline and no user data is sent to...
    Downloads: 29 This Week
    Last Update:
    See Project
  • 9
    douyin

    douyin

    Open source Douyin crawler for collecting and downloading public data

    ...It demonstrates how to build a Python-based web crawler combined with a graphical interface and command line functionality. It allows users to collect data from various types of Douyin content, including user profiles, videos, hashtags, and music pages. DouyinCrawler supports both automated scraping and batch operations to process multiple targets efficiently. It also integrates with the Aria2 download utility to enable large-scale downloading of videos and images associated with collected content. It includes multiple usage modes such as a desktop GUI, a web service interface, and a command line tool for flexible deployment. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    TikTok MCP

    TikTok MCP

    Model Context Protocol (MCP) with TikTok integration

    The TikTok MCP integrates TikTok access into AI applications like Claude AI via TikNeuron. It enables analysis and interaction with TikTok content to determine virality factors and extract video content. ​
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Easy Diffusion

    Easy Diffusion

    An easy 1-click way to create beautiful artwork on your PC using AI

    Easy Diffusion is a widely used community-driven repository offering a simple, one-click way to install and use Stable Diffusion-based generative AI on a personal computer without advanced technical skills or prior setup. It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts away environment setup, dependencies, and model installation — tasks that can be daunting to beginners — and instead lets users focus on creative experimentation with prompt phrasing, model parameters, and image output settings. ...
    Downloads: 47 This Week
    Last Update:
    See Project
  • 12
    Xianyu Intelligent Monitor Bot

    Xianyu Intelligent Monitor Bot

    AI tool for real-time monitoring and analysis of Goofish listings

    ...A built-in web management interface allows users to create tasks, review results, and manage monitoring rules without relying solely on command line tools. AI models analyze product descriptions, images, and seller information to determine whether a listing meets defined requirements and should be recommended to the user.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    AUTOMATIC1111 Stable Diffusion web UI
    AUTOMATIC1111's stable-diffusion-webui is a powerful, user-friendly web interface built on the Gradio library that allows users to easily interact with Stable Diffusion models for AI-powered image generation. Supporting both text-to-image (txt2img) and image-to-image (img2img) generation, this open-source UI offers a rich feature set including inpainting, outpainting, attention control, and multiple advanced upscaling options. With a flexible installation process across Windows, Linux, and...
    Downloads: 267 This Week
    Last Update:
    See Project
  • 14
    MCP Server Amazon Bedrock

    MCP Server Amazon Bedrock

    Model Context Procotol(MCP) server for using Amazon Bedrock

    The Amazon Bedrock MCP Server is an MCP server that integrates with Amazon Bedrock's Nova Canvas model for AI image generation. It allows users to generate high-quality images from text descriptions using Amazon's AI capabilities. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    StoryGen Atelier

    StoryGen Atelier

    AI-assisted storyboard and video generation tool

    StoryGen Atelier is an advanced creative tool that blends AI with visual storytelling, making it possible to generate fully structured storyboards and stitched videos from text prompts without requiring manual art or animation skills. Users begin with natural language descriptions of their story or scene, and the system uses state-of-the-art large models to generate both the script and corresponding frames.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Editly

    Editly

    Slick, declarative command line video editing & API

    Editly is a tool and framework for declarative NLE (non-linear video editing) using Node.js and FFmpeg. Editly allows you to easily and programmatically create a video from a set of clips, images, audio and titles, with smooth transitions and music overlaid. Editly has a simple CLI for quickly assembling a video from a set of clips or images, or you can use its more flexible JavaScript API. Inspired by ffmpeg-concat, Editly is much faster and doesn't require much storage because it uses...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    DeepCamera

    DeepCamera

    Open-Source AI Camera. Empower any camera/CCTV

    ...It provides open-source facial recognition-based intrusion detection, fall detection, and parking lot monitoring with the inference engine on your local device. SharpAI-hub is the cloud hosting for AI applications that helps you deploy AI applications with your CCTV camera on your edge device in minutes. SharpAI yolov7_reid is an open-source Python application that leverages AI technologies to detect intruders with traditional surveillance cameras. The source code is here It leverages Yolov7 as a person detector, FastReID for person feature extraction, Milvus the local vector database for self-supervised learning to identify unseen persons, Labelstudio to host images locally and for further usage such as label data and train your own classifier. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Open WebUI

    Open WebUI

    User-friendly AI Interface

    Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline. It supports various LLM runners like Ollama and OpenAI-compatible APIs, with a built-in inference engine for Retrieval Augmented Generation (RAG), making it a powerful AI deployment solution. Key features include effortless setup via Docker or Kubernetes, seamless integration with OpenAI-compatible APIs, granular permissions and user groups for enhanced security,...
    Downloads: 114 This Week
    Last Update:
    See Project
  • 19
    AIGCPanel

    AIGCPanel

    One-stop AI digital human system with video voice synthesis tools

    ...It supports one-click model deployment, making it accessible even to beginners who may not be familiar with complex AI environments. AIGCPanel also includes tools for synchronizing lip movements with generated speech, enabling more realistic digital avatar videos. Built using modern desktop technologies, it delivers a cross-platform experience while maintaining a graphical interface for monitoring tasks and logs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    canvas-constructor

    canvas-constructor

    An ES6 utility for canvas with built-in functions and chained methods

    An ES6 utility for canvas with built-in functions and chained methods. Alternatively, you can import canvas-constructor/browser. That will create a canvas with size of 300 pixels width, 300 pixels height. Set the color to #AEFD54. Draw a rectangle with the previous color, covering all the pixels from (5, 5) to (290 + 5, 290 + 5) Set the color to #FFAE23. Set the font size to 28 pixels with font Impact. Write the text 'Hello World!' in the position (130, 150) Return a buffer.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Markdownify MCP Server

    Markdownify MCP Server

    Convert files and web content into clean, usable Markdown easily

    ...It also allows retrieval of existing Markdown files, making it useful for documentation, research, and AI-assisted workflows. By standardizing content into Markdown, it helps unify inputs across different sources for better processing and integration with AI tools and developer environments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PyGPT

    PyGPT

    Open source personal AI Assistant for Linux, Windows and Mac

    PyGPT is a desktop application that allows you to talk to OpenAI's LLM models such as GPT4 and GPT3 using your own computer and OpenAI API. It allows you to talk in chat mode and in completion mode, as well as generate images using DALL-E 2. PyGPT also adds access to the Internet for GPT via Google Custom Search API and Wikipedia API and includes voice synthesis using Microsoft Azure Text-to-Speech API. Moreover, the application has implemented context memory support, context storage,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Talk to Figma MCP

    Talk to Figma MCP

    AI bridge enabling Cursor agents to read and modify Figma designs

    cursor-talk-to-figma-mcp is an open source integration that connects AI coding agents with Figma through the Model Context Protocol (MCP). It allows AI tools such as Cursor or other compatible agents to directly communicate with Figma documents and interact with design elements programmatically. Through this integration, an AI assistant can read the structure of a design, retrieve information about nodes or selections, and perform modifications to the layout or content....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    HyperTAG

    HyperTAG

    AI-Generated Tags and Summaries for Telegram Messages

    HyperTAG is a Telegram bot that leverages advanced AI models to generate context-aware tags and summaries for your messages. It can also automatically create tags and summaries for channel posts. The bot analyzes the text content of your messages along with any included links, producing relevant tags and summaries based on both the message text and the text from the links. Additionally, it generates summaries for Web pages or YouTube videos using their links.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    LandPPT

    LandPPT

    An LLM-based presentation generation platform

    ...The application integrates multiple AI models from providers such as OpenAI, Anthropic, Google, and locally hosted models to generate text, images, and structured presentation layouts. It also includes template systems and style options that allow presentations to be customized for different industries, visual themes, or storytelling formats.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next
MongoDB Logo MongoDB