Showing 10115 open source projects for "visual"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 1
    Playwright MCP

    Playwright MCP

    Playwright MCP server

    An MCP server developed by Microsoft that offers browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots without relying on visual data. ​
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    SQLTools

    SQLTools

    Database management for VSCode

    VSCode-SQLTools is a Visual Studio Code extension that enhances database management and development. It provides a rich set of features for connecting to databases, executing queries, and managing data directly within the code editor.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    Inkeep

    Inkeep

    Create AI Agents in a No-Code Visual Builder or TypeScript SDK

    Inkeep is an open-source framework for building and deploying AI agent workflows and interactive assistants that operate autonomously across applications, enterprise environments, and customer engagement use cases. It lets developers and non-technical users create, manage, and orchestrate multi-agent systems using both a no-code visual builder and a full TypeScript SDK, giving two ways to define agent behaviors that stay in sync with each other. Agents built with this framework can act as real-time conversational assistants — for example, handling help desk inquiries, providing internal support to teams, or driving in-app experiences — and they can be extended to automate multi-step tasks that interact with external systems like CRMs, knowledge bases, or ticketing systems. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    WeChatMsg

    WeChatMsg

    Project aimed at extracting, exporting, and analyzing chat records

    ...It provides tools that read local WeChat database files and allow users to convert chat data into readable formats such as HTML, Word, and CSV, making it possible to inspect conversations outside the mobile app environment. Beyond simple export, the project includes mechanisms for analyzing chat histories and generating annual reports or visual summaries about messaging trends, interaction patterns, and more. The original README communicates a guiding philosophy about owning personal data and using it responsibly to train personalized AI agents or preserve memories. Although the repository has seen periods of inactivity and may not receive frequent updates, its widespread use indicates community interest in preserving chat logs and understanding conversation data outside of the WeChat interface.
    Downloads: 193 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    clangd

    clangd

    clangd language server

    clangd understands your C++ code and adds smart features to your editor: code completion, compile errors, definition, and more. clangd is a language server that can work with many editors via a plugin. Here’s Visual Studio Code with the clangd plugin, demonstrating code completion.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 6
    Qwen-Image-Layered

    Qwen-Image-Layered

    Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

    Qwen-Image-Layered is an extension of the Qwen series of multimodal models that introduces layered image understanding, enabling the model to reason about hierarchical visual structures — such as separating foreground, background, objects, and contextual layers within an image. This architecture allows richer semantic interpretation, enabling use cases such as scene decomposition, object-level editing, layered captioning, and more fine-grained multimodal reasoning than with flat image encodings alone. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    Xamarin.iOS & Xamarin.Mac

    Xamarin.iOS & Xamarin.Mac

    Bridges the worlds of .NET with the native APIs of macOS, iOS, tvOS

    ...These SDKs allow us to create native iOS, tvOS, watchOS and macOS applications using the same UI controls we would in Objective-C and Xcode, except with the flexibility and elegance of a modern language (C#), the power of the .NET Base Class Library (BCL), and two first-class IDEs—Visual Studio for Mac and Visual Studio—at our fingertips. This repository is where we do development for the Xamarin.iOS and Xamarin.Mac SDKs. The preferred method for installing Xamarin.iOS and Mac is to use the Visual Studio installers (Windows, Mac). The team also strongly recommends using the latest Xamarin SDK and Xcode whenever possible.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    i.am.ai

    i.am.ai

    Roadmap to becoming an Artificial Intelligence Expert in 2022

    i.am.ai is a structured educational guide that maps out the knowledge areas and technologies required to become an artificial intelligence or machine learning expert. The project presents visual charts that outline multiple career paths such as data scientist, machine learning engineer, and AI specialist, helping learners understand what to study and in what order. It was originally created to train internal employees but was released publicly to support the broader community. The roadmap emphasizes foundational skills like mathematics, programming, and data handling before progressing into deep learning and specialized domains. ...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 9
    Orchis theme

    Orchis theme

    Material Design theme for GNOME/GTK based desktop environments

    ...Installation scripts make it easy to apply the theme across supported environments and also handle compatibility with different GTK versions. Because it follows Material Design principles, Orchis strikes a balance between simplicity and visual depth, giving interfaces a fresh and contemporary appeal.
    Downloads: 19 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    DeepSeek VL

    DeepSeek VL

    Towards Real-World Vision-Language Understanding

    DeepSeek-VL is DeepSeek’s initial vision-language model that anchors their multimodal stack. It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot). The repository includes model weights (or pointers to them), evaluation metrics on standard vision + language benchmarks, and configuration or architecture files. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Flyde

    Flyde

    Visual programming for developers. Includes a VS Code extension

    ...It supports everything you'd expect from a textual-based language; recursions, loops, and conditionals. Unlock new ways to think about, and debug your programs. Have an always-correct diagram of your program. Flyde’s flow-based, visual approach makes asynchronous and concurrent tasks simpler and more intuitive. In Flyde, nothing is hidden. From node implementation to runtime code. Easily access the code behind each node in Flyde’s standard library. Everything needed to run Flyde flows is open source and MIT-licensed. Create Flyde flows right from your IDE. Wrap existing TypeScript code in a Flyde node and use it in your flows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Dafthunk

    Dafthunk

    A workflow execution platform built on top of the fantastic Cloudflare

    ...It aims to combine the approachability of a visual editor with the practical needs of real automation: state persistence, execution history, reusable nodes, and integrations with external systems. A key appeal is that you can go from idea to running automation quickly in a hosted-like experience while still keeping the project open source and infrastructure-aware.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Universal Windows Platform app samples

    Universal Windows Platform app samples

    API samples for the Universal Windows Platform.

    This repo contains the samples that demonstrate the API usage patterns for the Universal Windows Platform (UWP) in the Windows Software Development Kit (SDK) for Windows 10. These code samples were created with the Universal Windows Platform templates available in Visual Studio, and are designed to run on desktop, mobile, and future devices that support the Universal Windows Platform. If you are unfamiliar with Git and GitHub, you can download the entire collection as a ZIP file, but be sure to unzip everything to access shared dependencies. These samples require Visual Studio and the Windows Software Development Kit (SDK) for Windows 10. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    Qwen3-VL

    Qwen3-VL

    Qwen3-VL, the multimodal large language model series by Alibaba Cloud

    Qwen3-VL is the latest multimodal large language model series from Alibaba Cloud’s Qwen team, designed to integrate advanced vision and language understanding. It represents a major upgrade in the Qwen lineup, with stronger text generation, deeper visual reasoning, and expanded multimodal comprehension. The model supports dense and Mixture-of-Experts (MoE) architectures, making it scalable from edge devices to cloud deployments, and is available in both instruction-tuned and reasoning-enhanced variants. Qwen3-VL is built for complex tasks such as GUI automation, multimodal coding (converting images or videos into HTML, CSS, JS, or Draw.io diagrams), long-context reasoning with support up to 1M tokens, and comprehensive video understanding. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    MangoHud

    MangoHud

    A Vulkan and OpenGL overlay for monitoring FPS, temperatures, CPU/GPU

    A Vulkan and OpenGL overlay for monitoring FPS, temperatures, CPU/GPU load and more. mangoplot is a plotting script that is shipped with MangoHud: on a given folder, it takes each log file, makes a 1D heatmap of its framerates, then stacks the heat maps vertically to form a 2D graph for easy visual comparison between benchmarks.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 16
    AWTRIX 3

    AWTRIX 3

    Custom firmware for the Ulanzi Smart Pixel clock or self made awtrix

    ...AWTRIX 3 comes pre-installed with apps that display time, date, temperature, and humidity, but advanced users can expand its functionality by creating custom apps and sending notifications via MQTT or HTTP protocols. The firmware is highly customizable, allowing users to manage icons, create animations, and implement various visual effects on the clock's display.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 17
    Google Snake Mod Loader

    Google Snake Mod Loader

    All Google Snake mods combined into one

    Google Snake Mod Loader is an open-source browser-based modification framework that extends the classic Google Snake game by enabling custom mods, gameplay enhancements, and visual changes directly within the web version of the game. It works by injecting JavaScript into the Google Snake page, allowing users to dynamically alter game behavior without needing to download or modify the original source code. The loader provides a centralized system for managing multiple mods, making it easy to toggle features such as new mechanics, custom maps, altered physics, or additional UI elements. ...
    Downloads: 159 This Week
    Last Update:
    See Project
  • 18
    ComfyUI-3D-Pack

    ComfyUI-3D-Pack

    An extensive node suite that enables ComfyUI to process 3D inputs

    ...It incorporates modern 3D generation technologies including neural radiance fields, Gaussian splatting, and other AI-driven reconstruction techniques. Through these nodes, users can convert images into 3D models, manipulate geometry, and experiment with generative 3D workflows inside the visual pipeline editor.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    diagrams.net

    diagrams.net

    Security-first diagramming for teams

    ...Office 365 app for Word, Powerpoint and Excel. Highest-rated Confluence app in the Atlassian Marketplace. Works in Jira too. Works with GitHub, GitLab and Dropbox for visual documentation in distributed teams. You can download diagrams.net Desktop for offline use on Windows, macOS, Linux and ChromeOS. Nextcloud is an ideal on-premise file hosting system for diagrams.net.
    Downloads: 120 This Week
    Last Update:
    See Project
  • 20
    drawio-desktop

    drawio-desktop

    Security-first diagramming for teams

    ...Office 365 app for Word, Powerpoint and Excel. Highest-rated Confluence app in the Atlassian Marketplace. Works in Jira too. Works with GitHub, GitLab and Dropbox for visual documentation in distributed teams. Download diagrams.net Desktop for offline use on Windows, macOS, Linux and ChromeOS. Nextcloud is an ideal on-premise file hosting system for diagrams.net. Entity relationship diagrams or ER models in software engineering show the structure of and relationships between database objects. They are used extensively in database modelling to plan new systems, and document existing systems for maintenance and updates.
    Downloads: 153 This Week
    Last Update:
    See Project
  • 21
    LTX-2.3

    LTX-2.3

    Official Python inference and LoRA trainer package

    ...Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while simultaneously producing corresponding audio elements such as speech, music, ambient sound, or effects. This unified approach allows creators to generate complete multimedia sequences where motion, timing, and sound are aligned automatically. LTX-2 is designed for both research and production workflows and can generate high-resolution video clips with precise control over structure, motion, and camera behavior.
    Downloads: 186 This Week
    Last Update:
    See Project
  • 22
    Unredact

    Unredact

    A simple tool for reading in poorly redacted documents

    ...Unlike traditional optical character recognition (OCR), which only reads visible text, Unredact focuses on inferring missing content where redaction has been applied by analyzing surrounding context, font characteristics, and linguistic patterns to produce candidate reconstructions. It accepts a variety of input formats, automatically identifies redacted regions, and then generates text suggestions that are presented alongside visual overlays so users can choose or refine outputs.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 23
    LLM Vision

    LLM Vision

    Visual intelligence for your home.

    ...The project enables Home Assistant to analyze images, video files, and live camera feeds using vision-capable AI models. Instead of relying only on traditional object detection pipelines, it allows users to send prompts about visual content and receive contextual descriptions or answers about what is happening in camera footage. The system can process events from surveillance platforms such as Frigate and convert them into meaningful summaries, notifications, or structured data for automation workflows. It also maintains a timeline of analyzed camera events that can be displayed in dashboards or queried through the assistant interface.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    OpenClaw

    OpenClaw

    Your own personal AI assistant. Any OS. Any Platform.

    OpenClaw (formerly Clawdbot/Moltbot) is an open-source, self-hosted autonomous AI assistant designed to run on user-controlled hardware and bridge conversational natural language with real-world task execution, effectively acting as a proactive digital assistant rather than a reactive chatbot. It lets you send instructions through familiar messaging platforms like WhatsApp, Telegram, Discord, Slack, Signal, iMessage, and more, and then interprets those instructions to carry out actions such...
    Downloads: 443 This Week
    Last Update:
    See Project
  • 25
    COBE

    COBE

    5KB WebGL globe lib

    ...The library focuses on simplicity, allowing developers to create animated and interactive globe experiences with just a few lines of code. It supports smooth animations, rotation, and customizable visual styles, making it suitable for dashboards, landing pages, and data visualization tools. The rendering is optimized to be efficient and responsive, even on lower-powered devices. It also provides configuration options for lighting, markers, and textures, enabling a wide range of visual effects. Overall, cobe offers an accessible way to integrate 3D globe visuals into modern web interfaces.
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB