Search Results for "generate image" - Page 2

Showing 334 open source projects for "generate image"

View related business solutions
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    Guizang Social Card Skill

    Guizang Social Card Skill

    Claude Code / Codex skill — generate Xiaohongshu carousels

    Guizang Social Card Skill is an AI-agent skill for generating polished social image packages in a Guizang-inspired visual style. It is designed for formats such as Xiaohongshu or Rednote carousels, WeChat Official Account covers, article covers, product update graphics, thumbnails, and screenshot-heavy posts. The skill turns articles, scripts, screenshots, product notes, subtitles, or photos into structured social card outputs. It supports editorial magazine layouts and Swiss-style visual...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Aidea

    Aidea

    Flutter-based cross-platform app integrating major AI models

    AIdea is a comprehensive Flutter-based cross-platform app integrating major AI models—OpenAI GPT, Chinese models Tongyi Qianwen and Wenxin Yiyan, plus image models like Stable Diffusion for text-to-image, image-to-image, SDXL 1.0, super-resolution, and colorization. It includes a client app, server backend, and Docker deployment scripts for hosted setups.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Agregore Browser

    Agregore Browser

    A minimal browser for the distributed web (Desktop version)

    A minimal web browser for the distributed web. Web Extension support. Built-in Markdown/Gemini/JSON rendering extension. Built-in QR code scanner and generator extension. Generate a QR code for the current page. Scan a QR code from the browser action window. Right-click a link or image to generate a QR code for it. Built-in ad blocker (ublock origin). Built-in support for creating web archives via ArchiveWeb.page.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    HunyuanCustom

    HunyuanCustom

    Multimodal-Driven Architecture for Customized Video Generation

    HunyuanCustom is a multimodal video customization framework by Tencent Hunyuan, aimed at generating customized videos featuring particular subjects (people, characters) under flexible conditions, while maintaining subject/identity consistency. It supports conditioning via image, audio, video, and text, and can perform subject replacement in videos, generate avatars speaking given audio, or combine multiple subject images. The architecture builds on HunyuanVideo, with added modules for identity reinforcement and modality-specific condition injection. Text-image fusion module based on LLaVA for improved multimodal understanding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 5
    InfiniteYou

    InfiniteYou

    Flexible Photo Recrafting While Preserving Your Identity

    InfiniteYou is an open-source image-generation and “identity-preserving image editing / generation” framework from ByteDance, designed to generate high-fidelity images that preserve a subject’s identity while allowing flexible editing or re-creation according to textual prompts. Using an architecture built around diffusion transformers (DiTs), InfiniteYou introduces a component called InfuseNet that injects identity features derived from reference images into the generation process — via residual connections — so that the output matches the person’s identity closely, without sacrificing visual quality or text-image alignment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    OpenAI DALL·E AsyncImage SwiftUI

    OpenAI DALL·E AsyncImage SwiftUI

    OpenAI swift async text to image for SwiftUI app using OpenAI

    SwiftUI views that asynchronously loads and displays an OpenAI image from open API. You just type in your idea and AI will give you an art solution. DALL-E and DALL-E 2 are deep learning models developed by OpenAI to generate digital images from natural language descriptions, called "prompts". You need to have Xcode 13 installed in order to have access to Documentation Compiler (DocC) OpenAI's text-to-image model DALL-E 2 is a recent example of diffusion models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Story Flicks

    Story Flicks

    Generate high-definition story short videos with one click using AI

    ...Because the project is open and modifiable, developers can customize the generation pipeline: adjust story structure, alter rendering parameters, tweak video quality or resolution, or integrate with other AI models (e.g. for audio, voice-over, or image-to-video). It’s especially useful as a starting template or experimentation ground for developers building automated content-creation tools.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    MCP Server Amazon Bedrock

    MCP Server Amazon Bedrock

    Model Context Procotol(MCP) server for using Amazon Bedrock

    The Amazon Bedrock MCP Server is an MCP server that integrates with Amazon Bedrock's Nova Canvas model for AI image generation. It allows users to generate high-quality images from text descriptions using Amazon's AI capabilities. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Mini QR

    Mini QR

    Create & scan cute qr codes easily

    Mini QR is a web app focused on making QR codes feel friendly and design-forward, combining a polished QR generator with a built-in scanner so you can both create and decode codes in the same place. It emphasizes customization so the QR you generate can match a brand, event theme, or personal style, including color and styling controls, framed layouts with labels, and the ability to add a logo image. Because QR reliability matters as much as looks, it exposes practical settings like error correction levels so you can balance data density with scannability, especially when adding a logo or encoding larger payloads. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 10
    Hunyuan3D 2.0

    Hunyuan3D 2.0

    High-Resolution 3D Assets Generation with Large Scale Diffusion Models

    The Hunyuan3D-2 model, developed by Tencent, is designed for generating high-resolution 3D assets using large-scale diffusion models. This model offers advanced capabilities for creating detailed 3D models, including texture enhancements, multi-view shape generation, and rapid inference for real-time applications. It is particularly useful for industries requiring high-quality 3D content, such as gaming, film, and virtual reality. Hunyuan3D-2 supports various enhancements and is available...
    Downloads: 33 This Week
    Last Update:
    See Project
  • 11
    Docker Gradle Plugin

    Docker Gradle Plugin

    Gradle plugin for orchestrating docker builds and pushes

    a Gradle plugin for orchestrating docker builds and pushes. Add basic tasks for building and pushing docker images based on a simple configuration block that specifies the container name, the Dockerfile, task dependencies, and any additional file resources required for the Docker build. Adds a task for populating placeholders in a docker-compose template file with image versions resolved from dependencies. Adds tasks for starting, stopping, statusing and cleaning up a named container based...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    InternLM-XComposer-2.5

    InternLM-XComposer-2.5

    InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System

    InternLM-XComposer is an open-source multimodal AI system designed to generate long-form content that combines text with visual elements such as images and diagrams. The model is built on top of the InternLM language model architecture and extends its capabilities to handle multimodal inputs and outputs. Instead of producing only textual responses, the system can generate visually enriched documents such as illustrated articles, presentations, and educational materials. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Jekyll Picture Tag

    Jekyll Picture Tag

    Easy responsive images for Jekyll

    My life just got a lot busier; I'd really like a maintainer or two to help. I'm not abandoning JPT, I just don't have a ton of time to put into hacking on it. If you've been learning Ruby and you want to move beyond tutorials and throwaway projects, I'd love to hear from you. I'd be happy to help you gain experience and credibility, if you're willing to help me maintain this project! It's simple to throw a photo on a page and call it a day, but doing justice to users on all different...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Claude Engineer v3

    Claude Engineer v3

    Claude Engineer is an interactive command-line interface (CLI)

    ...It provides both a command-line interface and a modern web interface for coding-related workflows. The project is designed to let Claude identify missing capabilities, generate new tools, load them dynamically, and use them in later conversations. It includes real-time token tracking, conversation state management, tool usage indicators, and structured configuration options. The web interface supports image upload, Markdown rendering, syntax highlighting, and responsive design. Overall, it is an experimental framework for developers who want an AI assistant that can expand its own toolset while helping with software tasks.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    X-AnyLabeling

    X-AnyLabeling

    Effortless data labeling with AI support from Segment Anything

    X-AnyLabeling is an open-source data annotation platform designed to streamline the process of labeling datasets for computer vision and multimodal AI applications. The software integrates an AI-powered labeling engine that allows users to generate annotations automatically with the assistance of modern vision models such as Segment Anything and various object detection frameworks. It supports labeling tasks across images and videos and enables developers to prepare training datasets for...
    Downloads: 102 This Week
    Last Update:
    See Project
  • 16
    Imagen - Pytorch

    Imagen - Pytorch

    Implementation of Imagen, Google's Text-to-Image Neural Network

    Implementation of Imagen, Google's Text-to-Image Neural Network that beats DALL-E2, in Pytorch. It is the new SOTA for text-to-image synthesis. Architecturally, it is actually much simpler than DALL-E2. It consists of a cascading DDPM conditioned on text embeddings from a large pre-trained T5 model (attention network). It also contains dynamic clipping for improved classifier-free guidance, noise level conditioning, and a memory-efficient unit design. It appears neither CLIP nor prior...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Redoc

    Redoc

    OpenAPI/Swagger-generated API Reference Documentation

    Redoc is an open source tool for generating documentation from OpenAPI (formerly Swagger) definitions. With Redoc, you can generate beautiful API documentation from OpenAPI. Redoc is provided as a CLI tool (also distributed as a Docker image), HTML tag, and React component. If you have Node installed, quickly generate documentation using npx. Open the HTML file in your browser, and your API documentation is shown on the page. Redoc is highly configurable. Redoc offers OpenAPI specification extensions.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    NVIDIA Cosmos

    NVIDIA Cosmos

    NVIDIA Cosmos is an open platform of world models, datasets

    NVIDIA Cosmos is an open platform for building physical AI with world models, datasets, and development tools. It is designed for systems that need to understand, simulate, and generate real-world environments. The project supports robotics, autonomous vehicles, smart infrastructure, video analytics, and other embodied AI use cases. It includes model checkpoints, curated synthetic datasets, evaluation benchmarks, and code for research and deployment. Cosmos 3 expands the platform with omnimodal world models that can work across language, image, video, audio, and action sequences. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    vim-ai

    vim-ai

    AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim

    vim-ai is an AI-powered assistant plugin for Vim and Neovim that brings language-model features directly into the editor. It allows users to generate code or text, edit selections in place, and carry on interactive chat-style conversations without leaving the terminal editing environment. The plugin is built around OpenAI-compatible APIs, which means it can work not only with OpenAI itself but also with compatible proxies and alternative providers. Its command set covers text completion, editing, chat continuation, image generation, and debugging utilities, making it more versatile than a narrow autocomplete add-on. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    EndeavourOS-ISO

    EndeavourOS-ISO

    EndeavourOS ISO framework based on Arch-ISO

    EndeavourOS-ISO is the official installation image build system for EndeavourOS, a community-driven Arch Linux–based distribution that focuses on providing a minimal, flexible, and user-friendly Arch experience. The repository contains the scripts, configurations, and package selections required to generate the live ISO used to install the operating system, allowing developers and contributors to customize installation environments and desktop setups.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 21
    ChatGPT Discord Bot

    ChatGPT Discord Bot

    Integrate ChatGPT into your own discord bot

    Build your own Discord bot using ChatGPT. Using certain personas may generate vulgar or disturbing content. Use at your own risk. Public mode (default), the bot directly replies on the channel. The bot's reply can only be seen by the person who used the command. Invite your bot to your server via OAuth2 URL Generator. Email/Password authentication (Not supported for Google/Microsoft accounts). A system prompt would be invoked when the bot is first started or reset. You can set it up by...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    AI Logo Generator

    AI Logo Generator

    A free + OSS logo generator powered by Flux on Together AI

    AI Logo Generator is an open-source AI logo generator that lets you create professional-looking logos in seconds from a simple text prompt. It uses the Flux Pro 1.1 model hosted on Together AI to generate logos, so the heavy lifting is done by a state-of-the-art image model while the app focuses on UX and workflow. The project is built with Next.js and TypeScript, and it uses shadcn/ui plus Tailwind CSS for a modern, responsive interface that feels like a polished SaaS product rather than a demo. It integrates Clerk for authentication so users can sign in, save their logo history (planned via a dashboard), and potentially manage usage tied to their own API key. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    HivisionIDPhoto

    HivisionIDPhoto

    HivisionIDPhotos: a lightweight and efficient AI ID photos tools

    ...It also allows the generation of layout sheets such as six-inch photo arrangements for printing multiple ID photos on a single page. The project focuses on building a practical pipeline for automated ID photo production using AI-based segmentation and image processing techniques.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Snappy PDF

    Snappy PDF

    A ServiceProvider for Snappy

    Laravel Snappy is a Laravel wrapper around the Snappy PDF/Image library, which itself is powered by wkhtmltopdf and wkhtmltoimage, allowing you to generate PDFs and images directly from HTML. It lets you take a Blade view, raw HTML string, or file and turn it into a downloadable, savable, or in-browser PDF/image response with just a few lines of code. The package integrates cleanly with the Laravel service container and offers a simple facade/API so you can quickly configure page size, orientation, margins, and other options. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    draw-a-ui

    draw-a-ui

    Draw wireframe sketches and generate HTML with AI vision models

    draw-a-ui is an experimental open source application that converts hand-drawn interface wireframes into working HTML code using artificial intelligence. draw-a-ui combines the tldraw canvas drawing tool with a vision-capable language model to interpret user-created mockups and translate them into a single HTML layout styled with Tailwind CSS. When a user sketches a UI on the canvas, the application captures the current drawing as SVG, converts it into a PNG image, and sends that image to a...
    Downloads: 3 This Week
    Last Update:
    See Project
Auth0 Logo