Showing 12 open source projects for "web image editor"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    vim-ai

    vim-ai

    AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim

    ...Its command set covers text completion, editing, chat continuation, image generation, and debugging utilities, making it more versatile than a narrow autocomplete add-on. The repository also highlights support for custom roles, vision features such as image-to-text, and an emerging provider-plugin model for extending compatibility further. A notable design point is that it only sends content the user explicitly selects or includes in prompts, which helps users control what is shared with the external model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenKnowledge

    OpenKnowledge

    Beautiful, AI-native markdown editor and LLM Wiki

    OpenKnowledge is an AI-native Markdown editor and LLM wiki for knowledge bases, specs, notes, and agent-friendly documentation. It is designed to make Markdown editing feel closer to a visual document editor while still preserving file-based workflows. The app supports a macOS desktop experience as well as a local web and CLI workflow for other platforms. It includes file navigation, search, tabs, wiki graph viewing, rich components, embeddable HTML, and terminal-oriented access. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    SD.Next

    SD.Next

    All-in-one WebUI for AI generative image and video creation

    SD.Next is an all-in-one web user interface for generative image creation that expands beyond basic Stable Diffusion workflows to cover broader image and video generation, captioning, and processing tasks. It is designed as a power-user environment where model management, generation features, and workflow controls are centralized in a single UI rather than spread across separate scripts and utilities.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    ChatJS

    ChatJS

    Production-ready AI chat. Start here and make it your own

    ...The project presents itself as a starting point that already includes authentication, model access, streaming, and tool support so teams can focus on their own product-specific features. Its stack is centered on modern TypeScript web development, using technologies such as Next.js, AI SDK tooling, PostgreSQL, Redis, and type-safe configuration patterns. The repository highlights access to more than 120 models through a unified interface, covering providers such as Claude, GPT, Gemini, and Grok. It also supports practical product features like attachments, resumable streams, branching conversations, sharing, web search, image generation, code execution, and MCP support.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 5
    LocalAI

    LocalAI

    The free, Open Source alternative to OpenAI, Claude and others

    LocalAI is an open-source platform that allows users to run large language models and other AI systems locally on their own hardware. It acts as a drop-in replacement for APIs such as OpenAI, enabling developers to build AI-powered applications without relying on external cloud services. The platform supports a wide range of model types, including text generation, image creation, speech processing, and embeddings. LocalAI can run on consumer-grade hardware and does not necessarily require a...
    Downloads: 37 This Week
    Last Update:
    See Project
  • 6
    VisualGLM-6B

    VisualGLM-6B

    Chinese and English multimodal conversational language model

    ...It builds on the ChatGLM-6B backbone, with 6.2 billion language parameters, and incorporates a BLIP2-Qformer visual module to connect vision and language. In total, the model has 7.8 billion parameters. Trained on a large bilingual dataset — including 30 million high-quality Chinese image-text pairs from CogView and 300 million English pairs — VisualGLM-6B is designed for image understanding, description, and question answering. Fine-tuning on long visual QA datasets further aligns the model’s responses with human preferences. The repository provides inference APIs, command-line demos, web demos, and efficient fine-tuning options like LoRA, QLoRA, and P-tuning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Qwen2.5-Omni

    Qwen2.5-Omni

    Capable of understanding text, audio, vision, video

    Qwen2.5-Omni is an end-to-end multimodal flagship model in the Qwen series by Alibaba Cloud, designed to process multiple modalities (text, images, audio, video) and generate responses both as text and natural speech in streaming real-time. It supports “Thinker-Talker” architecture, and introduces innovations for aligning modalities over time (for example synchronizing video/audio), robust speech generation, and low-VRAM/quantized versions to make usage more accessible. It holds...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Rocketnotes

    Rocketnotes

    AI-powered markdown editor - leverage LLMs with your documents

    RocketNotes is an open-source note-taking application designed to combine traditional knowledge management with artificial intelligence features that enhance how users capture and organize information. The project focuses on providing a fast, lightweight environment where users can create structured notes, manage personal knowledge bases, and interact with AI tools to summarize or expand their content. Instead of functioning purely as a document editor, RocketNotes integrates AI capabilities...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Chat UI

    Chat UI

    The open source codebase powering HuggingChat

    Hugging Face Chat UI is an open-source web interface designed for interacting with large language models through a modern conversational interface. The project serves as the codebase behind HuggingChat and can be deployed locally or on cloud infrastructure to create customizable AI chat applications. Built with modern web technologies such as SvelteKit and backed by MongoDB for persistence, the interface provides a responsive environment for multi-turn conversations, file handling, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 10
    Qwen3-Omni

    Qwen3-Omni

    Qwen3-omni is a natively end-to-end, omni-modal LLM

    Qwen3-Omni is a natively end-to-end multilingual omni-modal foundation model that processes text, images, audio, and video and delivers real-time streaming responses in text and natural speech. It uses a Thinker-Talker architecture with a Mixture-of-Experts (MoE) design, early text-first pretraining, and mixed multimodal training to support strong performance across all modalities without sacrificing text or image quality. The model supports 119 text languages, 19 speech input languages, and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Anything to NotebookLM

    Anything to NotebookLM

    Multi-source content processor for NotebookLM

    Qiaomu Anything to NotebookLM is a Claude Code skill that turns many types of source material into structured NotebookLM-ready outputs. It is built for users who want to convert articles, web pages, videos, PDFs, office files, podcasts, images, and search results into more usable study or presentation formats. The project uses natural-language commands, so the user can ask for a podcast, slide deck, mind map, report, quiz, flashcards, or infographic without manually building the workflow. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CogVLM

    CogVLM

    A state-of-the-art open visual language model

    CogVLM is an open-source visual–language model suite—and its GUI-oriented sibling CogAgent—aimed at image understanding, grounding, and multi-turn dialogue, with optional agent actions on real UI screenshots. The flagship CogVLM-17B combines ~10B visual parameters with ~7B language parameters and supports 490×490 inputs; CogAgent-18B extends this to 1120×1120 and adds plan/next-action outputs plus grounded operation coordinates for GUI tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
Auth0 Logo