Search Results for "image processing toolbox for..." - Page 5

Showing 851 open source projects for "image processing toolbox for..."

View related business solutions
  • Streamline Azure Security with Palo Alto Networks VM-Series Icon
    Streamline Azure Security with Palo Alto Networks VM-Series

    Centrally manage physical and virtualized firewalls with Panorama

    Improve your security posture and reduce incident response time. Use the VM-Series to natively analyze Azure traffic and dynamically drive policy updates based on workload changes.
    Learn more
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 1
    AutoGluon

    AutoGluon

    AutoGluon: AutoML for Image, Text, and Tabular Data

    AutoGluon enables easy-to-use and easy-to-extend AutoML with a focus on automated stack ensembling, deep learning, and real-world applications spanning image, text, and tabular data. Intended for both ML beginners and experts, AutoGluon enables you to quickly prototype deep learning and classical ML solutions for your raw data with a few lines of code. Automatically utilize state-of-the-art techniques (where appropriate) without expert knowledge. Leverage automatic hyperparameter tuning, model selection/ensembling, architecture search, and data processing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MDCx

    MDCx

    Movie metadata scraper and organizer for media libraries and NFO

    ...MDCx can download information such as titles, cast data, artwork, and other metadata, then generate standardized NFO files compatible with media management systems. It also supports image processing tasks such as downloading and cropping artwork used by media centers. It includes several interfaces, allowing users to operate it through a graphical desktop application, a browser-based web interface, or command-line utilities depending on their workflow. Its architecture separates core scraping logic from the user interfaces, allowing the same metadata processing system to be reused across different modes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Weaviate

    Weaviate

    Weaviate is a cloud-native, modular, real-time vector search engine

    Weaviate in a nutshell: Weaviate is a vector search engine and vector database. Weaviate uses machine learning to vectorize and store data, and to find answers to natural language queries. With Weaviate you can also bring your custom ML models to production scale. Weaviate in detail: Weaviate is a low-latency vector search engine with out-of-the-box support for different media types (text, images, etc.). It offers Semantic Search, Question-Answer-Extraction, Classification, Customizable...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    LearnOpenCV

    LearnOpenCV

    C++ and Python Examples

    LearnOpenCV is a large educational repository that provides practical computer vision and deep learning examples in both Python and C++. The project accompanies the LearnOpenCV blog and contains hundreds of hands-on tutorials covering topics such as object detection, image processing, pose estimation, and neural networks. It is structured as a learning resource where each directory corresponds to a specific article or technical walkthrough. The repository supports beginners and advanced practitioners by offering reproducible code that demonstrates real-world computer vision techniques. Many examples integrate popular frameworks like PyTorch, OpenCV, and ONNX to reflect modern AI workflows. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    HivisionIDPhoto

    HivisionIDPhoto

    HivisionIDPhotos: a lightweight and efficient AI ID photos tools

    ...It also allows the generation of layout sheets such as six-inch photo arrangements for printing multiple ID photos on a single page. The project focuses on building a practical pipeline for automated ID photo production using AI-based segmentation and image processing techniques.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    loonflow

    loonflow

    A workflow engine base on django python

    a workflow engine base on django The django-based workflow engine system (called through the http interface, can be used as a unified workflow engine within the enterprise, providing all workflows such as permission application, resource application, release application, leave, reimbursement, it service, etc. Scenario services), if there is a certain development capability, it is recommended to use only the back-end engine function, and the front-end customized development according to the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    ...It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. The examples highlight how to integrate AI models, manage prompts, handle memory, and build multi-model or multi-agent workflows. Developers can explore individual project folders for detailed instructions and implementation guidance. Spring AI Alibaba Examples also supports experimentation through playground modules and encourages contributions to expand real-world AI use cases and improve development practices.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    GeoAI

    GeoAI

    GeoAI: Artificial Intelligence for Geospatial Data

    GeoAI is a comprehensive open-source Python package designed to integrate artificial intelligence techniques with geospatial data analysis, enabling users to perform advanced geographic modeling and visualization tasks with ease. It provides a unified framework that combines machine learning libraries such as PyTorch and Transformers with geospatial tools, allowing users to process satellite imagery, aerial photos, and vector datasets in a streamlined workflow. The platform supports a wide...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    TensorRT Node for ComfyUI

    TensorRT Node for ComfyUI

    Enables the best performance on NVIDIA RTX Graphics Cards

    ComfyUI_TensorRT is an extension that lets ComfyUI run AI inference through NVIDIA’s TensorRT, aiming to get faster, more efficient execution on supported GPUs. It bridges the gap between ComfyUI’s flexible, node-based workflows and TensorRT’s highly optimized engine format. The result is that complex diffusion or image-processing graphs can be accelerated without the user having to rewrite the pipeline. The repo typically includes instructions for converting models to TensorRT engines and for wiring those engines into ComfyUI nodes. This is particularly attractive for power users who run many generations or who host ComfyUI on dedicated hardware and want to squeeze out every bit of GPU performance. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    Replicate Flux MCP

    Replicate Flux MCP

    MCP for Replicate Flux Model

    The Replicate Flux MCP is an advanced Model Context Protocol server that empowers AI assistants to generate high-quality images and vector graphics. It leverages Black Forest Labs' Flux Schnell model for raster images and Recraft's V3 SVG model for vector graphics via the Replicate API. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Readest

    Readest

    Readest is a modern, feature-rich ebook reader

    Readest is a project meant to facilitate reading, studying, or consuming content by integrating reading tools with AI-powered assistance. Although the repository is not as widely documented or popular as some, the idea is that Readest supports features to help with reading comprehension — likely combining OCR / text retrieval, translation, note-taking, or summarization for reading materials (eBooks, articles, PDFs). The goal appears to be to let users feed in arbitrary reading material and...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 12

    GraphicsMagick

    Swiss army knife of image processing

    GraphicsMagick provides a set of commandline tools and programming APIs for manipulating, editing, and converting raster and vector images. It is derived from ImageMagick, with the objective of providing better stability and performance than ImageMagick while retaining the original MIT/X11 license.
    Leader badge
    Downloads: 3,319 This Week
    Last Update:
    See Project
  • 13
    Advanced AI explainability for PyTorch

    Advanced AI explainability for PyTorch

    Advanced AI Explainability for computer vision

    pytorch-grad-cam is an open-source library that provides advanced explainable AI techniques for interpreting the predictions of deep learning models used in computer vision. The project implements Grad-CAM and several related visualization methods that highlight the regions of an image that most strongly influence a neural network’s decision. These visualization techniques allow developers and researchers to better understand how convolutional neural networks and transformer-based vision...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Step3-VL-10B

    Step3-VL-10B

    Multimodal model achieving SOTA performance

    ...It achieves this efficiency and strong performance through unified pre-training on a massive 1.2 trillion-token multimodal corpus that jointly optimizes a language-aligned perception encoder with a powerful decoder, creating deep synergy between image processing and text understanding.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DreamCraft3D

    DreamCraft3D

    Official implementation of DreamCraft3D

    DreamCraft3D is DeepSeek’s generative 3D modeling framework / model family that likely extends their earlier 3D efforts (e.g. Shap-E or Point-E style models) with more capability, control, or expression. The name suggests a “dream crafting” metaphor—users probably supply textual or image prompts and generate 3D assets (point clouds, meshes, scenes). The repository includes model code, inference scripts, sample prompts, and possibly dataset preparation pipelines. It may integrate rendering or post-processing modules (e.g. mesh smoothing, texturing) to make the outputs more output-ready. Because 3D generation is hardware‐intensive, the repository likely also includes optimizations like quantization, pruning, or inference accelerations (e.g. using FlashMLA or DeepEP) to make the generation pipeline faster or more efficient. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OpenAI

    OpenAI

    Swift community driven package for OpenAI public API

    ...It simplifies the integration of AI capabilities into iOS, macOS, and other Swift-based applications by offering a clean abstraction over the underlying REST API, enabling developers to focus on functionality rather than low-level implementation details. The SDK supports a wide range of features including chat completions, embeddings, image generation, audio processing, and structured outputs, making it a comprehensive toolkit for building AI-powered applications. It also includes support for advanced features such as function calling, assistants, and tool integration through protocols like Model Context Protocol, enabling more complex and interactive AI workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Caesium - Image Compressor
    !! THIS PROJECT HAS BEEN MOVED!! https://github.com/Lymphatus/caesium-image-compressor Caesium reduces the size of your picture up to 90%, preserving the original visual quality. Allows you to save a lot of space and easily upload your pictures on the web in a moment. The software is user-friendly with a simple and clear interface.
    Downloads: 62 This Week
    Last Update:
    See Project
  • 18
    HunyuanVideo

    HunyuanVideo

    HunyuanVideo: A Systematic Framework For Large Video Generation Model

    HunyuanVideo is a cutting-edge framework designed for large-scale video generation, leveraging advanced AI techniques to synthesize videos from various inputs. It is implemented in PyTorch, providing pre-trained model weights and inference code for efficient deployment. The framework aims to push the boundaries of video generation quality, incorporating multiple innovative approaches to improve the realism and coherence of the generated content. Release of FP8 model weights to reduce GPU...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Gyroflow

    Gyroflow

    Video stabilization using gyroscope data

    Gyroflow is an advanced open-source video stabilization application that uses gyroscope and motion sensor data to produce highly accurate and cinematic stabilization results. Instead of relying solely on visual estimation like traditional software stabilizers, it processes real motion data recorded by cameras or external sensors to achieve more precise compensation. This approach allows it to correct complex camera movement, rolling shutter distortion, and lens artifacts while preserving...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 20
    GLM-4.5V

    GLM-4.5V

    GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

    GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Segmentation Models

    Segmentation Models

    Segmentation models with pretrained backbones. PyTorch

    Segmentation models with pre trained backbones. High-level API (just two lines to create a neural network) 9 models architectures for binary and multi class segmentation (including legendary Unet) 124 available encoders (and 500+ encoders from timm) All encoders have pre-trained weights for faster and better convergence. Popular metrics and losses for training routines. All encoders have pretrained weights. Preparing your data the same way as during weights pre-training may give you better...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless. Developers can compose custom pipelines...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 23
    mp-html

    mp-html

    Small program rich text component, supports rendering and editing html

    A powerful applet-rich text component. Small program rich text component supports rendering and editing HTML and supports use on WeChat, QQ, Baidu, Alipay, Toutiao, and uni-app platforms. Displaying dynamic HTML rich text is a necessary requirement for many applications. The applet platform does not support dom operations, making this a problem. The built-in rich-text component supports few tags and blocks all events, making it difficult for practical application. Therefore, there is such a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 24
    Waifu2x-Extension-GUI

    Waifu2x-Extension-GUI

    Photo/Video/GIF enlargement using machine learning

    Image & GIF & Video Super-Resolution using Deep Convolutional Neural Networks. Built-in image processing algorithm: Waifu2x / SRMD / RealSR / Anime4K / ACNet Built-in image processing engine: Waifu2x-caffe / Waifu2x-converter / Waifu2x-ncnn-vulkan / SRMD-ncnn-vulkan / RealSR-ncnn-vulkan / Anime4KCPP Github: https://github.com/AaronFeng753/Waifu2x-Extension-GUI
    Leader badge
    Downloads: 563 This Week
    Last Update:
    See Project
  • 25
    MiniMax Skills

    MiniMax Skills

    Development skills for AI coding agents

    MiniMax skills is a modular system designed to provide structured development capabilities for AI coding agents, enabling them to perform complex engineering tasks with guided workflows and domain-specific knowledge. It defines a set of reusable “skills” that encapsulate best practices, architectural patterns, and step-by-step processes for building applications across multiple platforms. These skills can be integrated into AI tools to improve the quality and consistency of generated code,...
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB