Showing 48 open source projects for "visual\"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhereβ€”across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Clarity AI Upscaler

    Clarity AI Upscaler

    AI Image Upscaler & Enhancer

    Clarity AI Upscaler is an open-source AI image enhancement tool designed to increase the resolution and visual quality of images using modern generative techniques. The system uses deep learning models based on diffusion and other image generation methods to reconstruct high-resolution versions of low-resolution images while preserving important visual details. Unlike traditional interpolation-based upscaling algorithms, the system generates additional visual information that improves perceived clarity and sharpness. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    yt-dlp-gui

    yt-dlp-gui

    A cross-platform GUI wrapper for yt-dlp written in PySide6

    yt-dlp-gui is a cross-platform graphical interface for the popular command-line video downloader yt-dlp, created to make video and audio downloads from sites like YouTube, Vimeo, Twitch, and others easier for everyday users without needing to work directly with command-line arguments. Written in PySide6 (Python with Qt bindings), it wraps the powerful yt-dlp engine in a visual application that lets users paste video URLs, choose formats, apply presets, and start downloads with a click, while still exposing options for advanced tweaks via configuration files. The project supports preset definitions and global arguments through a config file, so users can customize their most common download workflowsβ€”like audio extraction, quality ranking, or embedding thumbnailsβ€”without retyping arguments each time. ...
    Downloads: 198 This Week
    Last Update:
    See Project
  • 3
    SeedVR2 Upscaler ComfyUI

    SeedVR2 Upscaler ComfyUI

    Official SeedVR2 Video Upscaler for ComfyUI

    ComfyUI-SeedVR2 Video Upscaler is an open-source integration node for the ComfyUI workflow environment that brings the advanced SeedVR2 video upscaling and restoration model directly into visual AI pipelines. This project packages the SeedVR2 architecture as a custom node for ComfyUI, letting users upscale low-resolution video or imagery inside a node-based interface without needing to write code manually. The underlying SeedVR2 model is known for delivering high-quality video enhancement with strong temporal consistency and improved detail preservation by using diffusion-based techniques that are trained specifically on video sequences. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 4
    VMZ (Video Model Zoo)

    VMZ (Video Model Zoo)

    VMZ: Model Zoo for Video Modeling

    The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    armory

    armory

    3D Engine with Blender Integration

    ...Powered by Armory engine, ArmorPaint is a stand-alone software designed for physically-based texture painting. Drag & drop your 3D models and start painting. Receive instant visual feedback in the viewport as you paint. Powered by Armory engine, ArmorLab is stand-alone software designed for AI-powered texture authoring. Generate PBR materials by drag & dropping your photos. In development! Armory is an open-source 3D game engine with full Blender integration. The engine is currently available in a form of early preview.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 6
    PersonaLive

    PersonaLive

    Expressive Portrait Image Animation for Live Streaming

    ...The framework prioritizes low-latency and streamable output, making it suitable for real-time creative workflows, broadcast overlays, or interactive avatars on consumer-grade GPUs. PersonaLive’s architecture balances visual quality and efficiency by combining motion encoding, temporal modules, and hybrid implicit control signals to preserve identity and stable expression through long sequences.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    VGGSfM

    VGGSfM

    VGGSfM: Visual Geometry Grounded Deep Structure From Motion

    VGGSfM is an advanced structure-from-motion (SfM) framework jointly developed by Meta AI Research (GenAI) and the University of Oxford’s Visual Geometry Group (VGG). It reconstructs 3D geometry, dense depth, and camera poses directly from unordered or sequential images and videos. The system combines learned feature matching and geometric optimization to generate high-quality camera calibrations, sparse/dense point clouds, and depth maps in standard COLMAP format. Version 2.0 adds support for dynamic scene handling, dense point cloud export, video-based reconstruction (1000+ frames), and integration with Gaussian Splatting pipelines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PyVista

    PyVista

    3D plotting and mesh analysis through a streamlined interface

    ...PyVista is a helper module for the Visualization Toolkit (VTK) that takes a different approach on interfacing with VTK through NumPy and direct array access. This package provides a Pythonic, well-documented interface exposing VTK’s powerful visualization backend to facilitate rapid prototyping, analysis, and visual integration of spatially referenced datasets. This module can be used for scientific plotting for presentations and research papers as well as a supporting module for other mesh-dependent Python modules. Easily integrate with NumPy and create a variety of geometries and plot them. You could use any geometry to create your glyphs, or even plot the points directly. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    SteadyDancer

    SteadyDancer

    Harmonized and Coherent Human Image Animation

    ...The system can be used both in preprocessing pipelines for content creators and in live feedback loops for performers, giving dancers and videographers a tool to refine their visual outputs. It supports integration with standard video formats and includes customizable parameters so users can tune stabilization aggressiveness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    FreeImage is a library project for developers who would like to support popular graphics image formats (PNG, JPEG, TIFF, BMP and others). Some highlights are: extremely simple in use, not limited to the local PC (unique FreeImageIO) and Plugin driven!
    Leader badge
    Downloads: 2,408 This Week
    Last Update:
    See Project
  • 11
    OpenPiano β€” Virtual Piano for Windows

    OpenPiano β€” Virtual Piano for Windows

    Desktop piano playable with a PC keyboard, mouse, or MIDI device.

    OpenPiano is a Windows desktop piano application that allows you to play, practice, and record music using your PC keyboard, mouse, or a MIDI device. It supports real-time playback using SoundFonts and provides on-screen piano layouts for visual feedback while playing. OpenPiano is designed to run entirely locally. It does not require accounts, cloud services, or an internet connection for core functionality. Project links: Website: https://www.justagwas.com/projects/openpiano GitHub: https://github.com/Justagwas/openpiano Documentation: https://github.com/Justagwas/openpiano/wiki The application is fully open source. ...
    Leader badge
    Downloads: 88 This Week
    Last Update:
    See Project
  • 12
    Caesium - Image Compressor
    ...THIS PROJECT HAS BEEN MOVED!! https://github.com/Lymphatus/caesium-image-compressor Caesium reduces the size of your picture up to 90%, preserving the original visual quality. Allows you to save a lot of space and easily upload your pictures on the web in a moment. The software is user-friendly with a simple and clear interface.
    Downloads: 83 This Week
    Last Update:
    See Project
  • 13
    HuMidi

    HuMidi

    Play MIDI like a human in ROBLOX with automatic sustain pedals.

    ...The pedals are generated with a thorough analysis of the MIDI data, adding more depth to your performance that no other MIDI players could! Personally have been tested in: - Visual Pianos - Starving Pianists - Digital Piano - Piano's Got Talent This tool is universal, as long as the piano in-game allows to be played on a keyboard.
    Downloads: 203 This Week
    Last Update:
    See Project
  • 14
    AnimateDiff

    AnimateDiff

    Plug-n-play module turning text-to-image models into animation

    ...This plug-and-play tool is compatible with a wide range of community models and facilitates the generation of animation directly from pre-existing text-to-image models. It supports various configurations to create animations with different visual styles, providing flexibility and ease of use for developers and artists interested in exploring dynamic, AI-generated animations.
    Leader badge
    Downloads: 40 This Week
    Last Update:
    See Project
  • 15
    xSTUDIO

    xSTUDIO

    xSTUDIO is a high performance playback and review tool.

    xSTUDIO is a high performance playback and review tool designed by and for Visual Effects, Animation and Post Production professionals. The application can load and play large collections of media files. The efficient playback engine allows you to quickly load and play high resolution image formats with a wide range of file formats and encoding. Intuitive tools allow you to create and organise playlists and media sub-sets within playlists to build interactive review sessions, image and video reference libraries. ...
    Leader badge
    Downloads: 29 This Week
    Last Update:
    See Project
  • 16
    SonicDive-8D-Music-Player

    SonicDive-8D-Music-Player

    SonicDive 8D Music Player v-1.0

    SonicDive is an immersive audio visualization & effects-based music player designed to deliver a next-level listening experience. It combines dynamic spectrums with advanced spatial audio effects like 3D & 8D sound. ✨ Features 🎡 Audio Visual Spectrums SonicDive supports multiple real-time audio visualizations: πŸ’Ώ Disk Spectrum πŸ“Š Bars Spectrum 🌊 Wave Spectrum πŸ–ΌοΈ Thumbnail Spectrum β­• Circle Spectrum Each spectrum reacts dynamically to the music’s frequency and intensity. 🎚️ Audio Effects & Modes Choose from a variety of sound profiles to match your mood: πŸ”Š Flat 🎧 3D Audio 🎧 8D Audio 🎀 Hip-Hop 🎻 Classic 🎸 Rock πŸŽ₯ Dolby Effect
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Bhakti-Apps
    This app is meant for a simplified user experience, perfect for note takers. It has a good encryption system, and the best feature is that it works on the CLI, so its compatibility is great.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    openSkyMatch

    Matches OpenScience Observatories images with astronomical catalogs

    openSkyMatch is a collection of Linux shell and Python scripts designed for the OpenScience Observatories program. It automates the identification and matching of detected celestial objects in locally captured FITS images with entries in large-scale sky catalogs, notably Pan-STARRS1 DR2 (II/389/ps1_dr2). The toolkit supports data preprocessing, coordinate correlation, and catalog-based validation of astronomical detections. All tools are open-source and optimized for reproducibility and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Random Colors

    Random Colors

    Generate random colors

    Generate random colors click on the frame to generate a new color copy HEX code for use in a visual art and communication application
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    PMS for REGZA

    A DLNA-compliant UPnP Media Server

    PMS for REGZA is a DLNA-compliant Media Server. As a fork build of well-known "PS3 Media Server", This aims especially to improve functionality on TOSHIBA REGZA TVs With preserving applicabilities to other Renderers. Details: Home Page: http://www32.atwiki.jp/pms_regza
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 21
    Surface Defect Detection Dataset Papers

    Surface Defect Detection Dataset Papers

    Constantly summarizing open source dataset and critical papers

    At present, surface defect equipment based on machine vision has widely replaced artificial visual inspection in various industrial fields, including 3C, automobiles, home appliances, machinery manufacturing, semiconductors and electronics, chemical, pharmaceutical, aerospace, light industry and other industries. Traditional surface defect detection methods based on machine vision often use conventional image processing algorithms or artificially designed features plus classifiers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    BasicSR

    BasicSR

    Winning Solution in NTIRE19 Challenges on Video Restoration

    ...The architecture includes bespoke modules (e.g., Pyramid, Cascading and Deformable alignment and Temporal Spatial Attention fusion) that align information across frames and emphasize important features for restoration, enabling state-of-the-art performance on benchmarks such as the REDS challenge. By fusing spatial and temporal context, EDVR produces significantly improved visual quality in restored videos compared with approaches that treat each frame independently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Consistent Depth

    Consistent Depth

    We estimate dense, flicker-free, geometrically consistent depth

    ...During inference, the model fine-tunes itself to align with the geometric constraints of a specific input video, ensuring stable and realistic depth maps even in less-constrained regions. This approach achieves improved geometric consistency and visual stability compared to prior monocular reconstruction methods. The project can process challenging hand-held video footage, including those with moderate dynamic motion, making it practical for real-world usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    GIF for CLI

    GIF for CLI

    Takes in a GIF, short video, or a query to the Tenor GIF API

    ...Depending on terminal capabilities, it can use ANSI color blocks or image protocols to achieve surprisingly faithful playback. The tool includes conveniences such as looping control, scaling to fit your terminal, and caching to avoid repeated downloads. It’s useful for quick visual feedback, lightweight demos, or just adding some personality to scripts and readmes. Despite its whimsical nature, the project is a tight example of frame decoding, timing, and terminal rendering glued together in a single command.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Imogen

    Imogen

    GPU Texture Generator

    ...It allows users to build complex material textures using a graph-based interface, combining operations like blending, noise, filters, and color correction in a non-destructive workflow. Built with Vulkan and ImGui, Imogen provides immediate visual feedback and supports GPU acceleration for high-resolution texture output. It's particularly useful in game development, VFX, and digital art where procedural workflows are valued for their flexibility and speed.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB