Showing 9193 open source projects for "visual"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    AIMr

    AIMr

    The best AI Aimbot for Fortnite, Valorant, CS2, R6, COD, Apex, & more

    ...The software includes various aiming enhancements, such as recoil control, silent aim, and prediction capabilities, aimed at making gameplay smoother and more competitive. AIMr also provides visual customization options like field-of-view displays and detection indicators, allowing players to tailor their interface. The system is compatible with games that use human-shaped models, and although it functions effectively out of the box, optimizing it with CUDA-accelerated OpenCV is recommended for maximum performance.
    Downloads: 289 This Week
    Last Update:
    See Project
  • 2
    Dify

    Dify

    One API for plugins and datasets, one interface for prompt engineering

    Dify is an easy-to-use LLMOps platform designed to empower more people to create sustainable, AI-native applications. With visual orchestration for various application types, Dify offers out-of-the-box, ready-to-use applications that can also serve as Backend-as-a-Service APIs. Unify your development process with one API for plugins and datasets integration, and streamline your operations using a single interface for prompt engineering, visual analytics, and continuous improvement. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    Digital-Logic-Sim

    Digital-Logic-Sim

    A minimalistic digital logic simulator

    Digital-Logic-Sim is an interactive simulation tool designed to help users understand digital logic circuits through a visual and hands-on environment. It allows users to build circuits using basic logic gates such as AND, OR, and NOT, and observe how signals propagate through the system in real time. The project emphasizes education, making it easier for beginners to grasp concepts like binary logic, combinational circuits, and signal flow. It provides a visual interface where components can be connected and tested dynamically, reinforcing learning through experimentation. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    lua-language-server

    lua-language-server

    A language server that offers Lua language support - programmed in Lua

    The Lua language server provides various language features for Lua to make development easier and faster. With around half a million installs on Visual Studio Code, it is the most popular extension for Lua language support. The language server can be installed for use in Visual Studio Code, NeoVim, and any other clients that support the Language Server Protocol. The language server can be configured using a configuration file. The language server and Visual Studio Code client can be installed from the VS Code Marketplace. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 5
    GalTransl

    GalTransl

    Automated translation solution for visual novels

    GalTransl is an automated translation system specifically designed for visual novels, particularly those in the “galgame” genre, leveraging large language models to streamline and enhance the translation process. It integrates support for multiple advanced LLM providers such as GPT-4, Claude, DeepSeek, and other models, enabling high-quality, context-aware translations that go beyond traditional machine translation approaches.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    ShiftMediaProject FFmpeg

    ShiftMediaProject FFmpeg

    Unofficial FFmpeg with added custom native Visual Studio project

    ShiftMediaProject FFmpeg is a specialized fork of FFmpeg tailored for building Windows-compatible binaries with additional support for third-party libraries and toolchains. It provides patches and build scripts that enable easier integration of FFmpeg into Windows development environments, particularly with Visual Studio. The project focuses on improving compatibility with external codecs and libraries that are not always supported in standard builds. It includes modifications that simplify compilation and linking processes, reducing the complexity of setting up FFmpeg on Windows systems. The fork is often used by developers who need customized FFmpeg builds with extended functionality. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Rackula

    Rackula

    Drag and drop rack visualizer

    Rackula is a browser-based rack layout designer aimed at homelabbers, audio/video technicians, and equipment organizers who want a visual way to plan and document physical device racks. It runs entirely client-side with no backend server required, making it lightweight, fast, and easy to self-host or run locally without external dependencies. Users can drag and drop devices into customizable rack spaces, annotate equipment, set unit sizes, and manage complex layouts as their setup evolves. ...
    Downloads: 38 This Week
    Last Update:
    See Project
  • 8
    builder.io

    builder.io

    Visual Development for React, Vue, Svelte, Qwik, and more

    Builder is an open-source visual development platform and headless content management system designed to help teams build and manage digital experiences more efficiently. The platform combines a drag-and-drop visual editor with an API-driven content system that can deliver structured content to any front-end framework or application. Builder allows developers to integrate their existing components and design systems while enabling designers and marketers to modify layouts and publish content without writing code. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    ViMax

    ViMax

    Director, Screenwriter, Producer, and Video Generator All-in-One

    ViMax is an open-source framework for performing large-scale multi-modal vision-language modeling and reasoning by combining powerful image encoders with advanced language models to solve complex visual tasks. It integrates components like visual encoders, cross-modal fusion techniques, and reasoning modules so that users can go beyond simple captioning or classification to perform tasks such as visual question answering, multi-image inference, and structured scene understanding. ViMax’s design accommodates large image sets and supports retrieval augmentation, enabling it to work with external image databases, supplementary metadata, and semantic search to enhance context awareness. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    SQLTools

    SQLTools

    Database management for VSCode

    VSCode-SQLTools is a Visual Studio Code extension that enhances database management and development. It provides a rich set of features for connecting to databases, executing queries, and managing data directly within the code editor.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    VGit

    VGit

    Visual git plugin for Neovim

    Visual Git Plugin for Neovim to enhance your git experience.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Playwright MCP

    Playwright MCP

    Playwright MCP server

    An MCP server developed by Microsoft that offers browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots without relying on visual data. ​
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Clash Verge

    Clash Verge

    A Clash Meta GUI based on Tauri

    Clash Verge Rev is an open-source desktop application that provides a modern graphical interface for managing network proxy configurations and routing internet traffic through rule-based tunneling systems. Built using the Tauri framework and Rust, it delivers a lightweight yet high-performance experience across Windows, macOS, and Linux platforms. The application is based on the Mihomo (formerly Clash Meta) core, which enables advanced proxy routing capabilities, allowing users to define...
    Downloads: 369 This Week
    Last Update:
    See Project
  • 14
    yt-dlp-gui

    yt-dlp-gui

    A cross-platform GUI wrapper for yt-dlp written in PySide6

    yt-dlp-gui is a cross-platform graphical interface for the popular command-line video downloader yt-dlp, created to make video and audio downloads from sites like YouTube, Vimeo, Twitch, and others easier for everyday users without needing to work directly with command-line arguments. Written in PySide6 (Python with Qt bindings), it wraps the powerful yt-dlp engine in a visual application that lets users paste video URLs, choose formats, apply presets, and start downloads with a click, while still exposing options for advanced tweaks via configuration files. The project supports preset definitions and global arguments through a config file, so users can customize their most common download workflows—like audio extraction, quality ranking, or embedding thumbnails—without retyping arguments each time. ...
    Downloads: 295 This Week
    Last Update:
    See Project
  • 15
    WiFi DensePose

    WiFi DensePose

    Turn WiFi signals into real-time human pose estimation and detection

    ...The repository includes components for data processing, model inference, and real-time visualization, making it suitable for research and experimental deployments. Its architecture emphasizes performance and reproducibility, allowing developers to explore non-visual motion capture systems using accessible hardware. Overall, WiFi DensePose functions as an advanced research-grade toolkit for WiFi-based human sensing and pose estimation.
    Downloads: 233 This Week
    Last Update:
    See Project
  • 16
    R1-V

    R1-V

    Witness the aha moment of VLM with less than $3

    R1-V is an initiative aimed at enhancing the generalization capabilities of Vision-Language Models (VLMs) through Reinforcement Learning in Visual Reasoning (RLVR). The project focuses on building a comprehensive framework that emphasizes algorithm enhancement, efficiency optimization, and task diversity to achieve general vision-language intelligence and visual/GUI agents. The team's long-term goal is to contribute impactful open-source research in this domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Pokémon: Evolved

    Pokémon: Evolved

    All of the original 151 pokemon now have longer evolutions lines

    ...The project modifies all original Generation I Pokémon to include additional evolution stages, increasing the total number of obtainable creatures and creating a more complex progression system. It introduces custom sprites, new Pokédex entries, and shiny variants for the added evolutions, enhancing both visual and gameplay variety. Core mechanics such as stats, movesets, and evolution levels have been rebalanced to provide a more challenging and engaging experience. The hack also ensures that all Pokémon can be obtained within a single playthrough by adjusting encounter rates and availability across routes. Additional quality-of-life changes, such as infinite rare candies and simplified evolution requirements, streamline progression while maintaining difficulty.
    Downloads: 173 This Week
    Last Update:
    See Project
  • 18
    Skywork-R1V4

    Skywork-R1V4

    Skywork-R1V is an advanced multimodal AI model series

    Skywork-R1V is an open-source multimodal reasoning model designed to extend the capabilities of large language models into vision-language tasks that require complex logical reasoning. The project introduces a model architecture that transfers the reasoning abilities of advanced text-based models into visual domains so the system can interpret images and perform multi-step reasoning about them. Instead of retraining both language and vision models from scratch, the framework uses a lightweight visual projection layer that connects a pretrained vision backbone with a reasoning-capable language model. This design allows the model to analyze images while maintaining strong textual reasoning performance, enabling tasks such as solving visual math problems, interpreting scientific diagrams, and answering questions about images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Clash Party

    Clash Party

    Another Mihomo GUI

    ...It includes intelligent routing features, including AI-assisted node selection and rule-based traffic management, which help optimize latency and connection reliability. Clash Party also emphasizes usability with a modern interface, customizable themes, and visual tools for monitoring connections and traffic activity. It supports configuration backup and synchronization, making it easier to manage complex setups across devices.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 20
    RobotCode

    RobotCode

    RobotFramework support for Visual Studio Code

    An extension that brings support for RobotFramework to Visual Studio Code, including features like code completion, debugging, test explorer, refactoring and more! With RobotCode you can edit your code with auto-completion, code navigation, syntax checking and many more.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    Clash Verge Rev

    Clash Verge Rev

    A modern GUI client based on Tauri

    Clash Verge Rev is an open-source desktop application that provides a modern graphical interface for managing network proxy configurations and routing internet traffic through rule-based tunneling systems. Built using the Tauri framework and Rust, it delivers a lightweight yet high-performance experience across Windows, macOS, and Linux platforms. The application is based on the Mihomo (formerly Clash Meta) core, which enables advanced proxy routing capabilities, allowing users to define...
    Downloads: 290 This Week
    Last Update:
    See Project
  • 22
    SAM 3

    SAM 3

    Code for running inference and finetuning with SAM 3 model

    SAM 3 (Segment Anything Model 3) is a unified foundation model for promptable segmentation in both images and videos, capable of detecting, segmenting, and tracking objects. It accepts both text prompts (open-vocabulary concepts like “red car” or “goalkeeper in white”) and visual prompts (points, boxes, masks) and returns high-quality masks, boxes, and scores for the requested concepts. Compared with SAM 2, SAM 3 introduces the ability to exhaustively segment all instances of an open-vocabulary concept specified by a short phrase or exemplars, scaling to a vastly larger set of categories than traditional closed-set models. ...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 23
    Clarity AI Upscaler

    Clarity AI Upscaler

    AI Image Upscaler & Enhancer

    Clarity AI Upscaler is an open-source AI image enhancement tool designed to increase the resolution and visual quality of images using modern generative techniques. The system uses deep learning models based on diffusion and other image generation methods to reconstruct high-resolution versions of low-resolution images while preserving important visual details. Unlike traditional interpolation-based upscaling algorithms, the system generates additional visual information that improves perceived clarity and sharpness. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 24
    GLM-Image

    GLM-Image

    GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image

    ...It excels at generating images that include complex layouts and detailed text content, making it especially useful for posters, diagrams, info-graphics, social media graphics, and visual content that requires precise text placement and semantic alignment. Because it blends linguistic reasoning with image synthesis, GLM-Image produces visual outputs where semantic relationships and textual accuracy are prioritized alongside artistic style and realism, and its model structure enables it to handle dense visual knowledge tasks that challenge many pure diffusion models. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Qwen-Image-Layered

    Qwen-Image-Layered

    Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

    Qwen-Image-Layered is an extension of the Qwen series of multimodal models that introduces layered image understanding, enabling the model to reason about hierarchical visual structures — such as separating foreground, background, objects, and contextual layers within an image. This architecture allows richer semantic interpretation, enabling use cases such as scene decomposition, object-level editing, layered captioning, and more fine-grained multimodal reasoning than with flat image encodings alone. ...
    Downloads: 8 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB