Showing 6 open source projects for "generate image"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    ComfyUI-HunyuanVideoWrapper

    ComfyUI-HunyuanVideoWrapper

    ComfyUI wrapper nodes for HunyuanVideo

    The ComfyUI-HunyuanVideoWrapper project is a ComfyUI extension that integrates Hunyuan-based multimodal video generation models into node-based workflows. It allows users to generate or manipulate video content by combining text prompts with one or more input images, enabling flexible conditioning of outputs. The system introduces specialized nodes such as text-image encoders that allow multiple image inputs to be referenced directly within prompts. This makes it possible to guide generation using both visual and textual context simultaneously. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    infinite-canvas

    infinite-canvas

    Infinite Canvas Workbench for AI creation integrates AI generation

    ...The project supports OpenAI-compatible API connections for text-to-image, image-to-image, reference editing, text chat, audio generation, and video generation. It also includes a canvas assistant that can discuss selected nodes, use upstream context, generate new outputs, and place results back onto the canvas. The project is still in active development and is better suited for personal or local deployment than stable public multi-user production use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    CowAgent

    CowAgent

    AI assistant based on large models that can actively think and plan

    CowAgent, based on the chatgpt-on-wechat project, is an open-source AI agent framework that integrates large language models into the WeChat ecosystem to create intelligent conversational assistants. It enables automated message handling by connecting WeChat accounts with AI models that can generate contextual replies, process voice messages, and produce images directly inside chats. The platform has evolved beyond a simple chatbot into a more autonomous agent capable of planning complex...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Palmier Pro

    Palmier Pro

    macOS video editor built for AI

    Palmier Pro is an open-source video editor for Mac built around AI-assisted video creation. It lets users and coding agents work together directly inside a timeline, blending traditional editing with generative workflows. The app is written from scratch in Swift and takes inspiration from professional editors like Premiere Pro while rethinking the workflow around AI. Users can generate videos and images inside the editor with models such as Seedance, Kling, and Nano Banana Pro. It also...
    Downloads: 4 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 5
    Gemini CLI

    Gemini CLI

    Open source AI agent CLI tool to bring Gemini into your terminal

    Gemini CLI is an open‑source AI agent that brings the capabilities of Google’s Gemini 2.5 Pro large‑language model directly into your terminal, enabling tasks ranging from coding and debugging to content creation and research via natural‑language prompts, with support for multimodal outputs like image and video generation. Gemini CLI integrates with external tools and MCP servers, enabling media generation and enhanced workflow automation. It also includes a built-in Google Search tool to...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 6
    Vane

    Vane

    Vane is an AI-powered answering engine

    ...The platform supports both local LLMs through Ollama and cloud providers such as OpenAI, Claude, Gemini, and Groq, giving users flexibility in how queries are processed. It integrates web search through SearxNG while also supporting discussions, academic sources, image search, and video search to generate citation-backed responses. Vane includes multiple search modes optimized for speed, balanced usage, or deep research depending on the complexity of the query. Its architecture emphasizes modular orchestration, custom provider systems, streaming responses, and widget-based UI enhancements for calculations, weather, and contextual data. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo