Search Results for "image processing toolbox for..." - Page 3

Showing 636 open source projects for "image processing toolbox for..."

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Unstructured.IO

    Unstructured.IO

    Open source libraries and APIs to build custom preprocessing pipelines

    The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data into...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Diffusers

    Diffusers

    State-of-the-art diffusion models for image and audio generation

    Diffusers is the go-to library for state-of-the-art pretrained diffusion models for generating images, audio, and even 3D structures of molecules. Whether you're looking for a simple inference solution or training your own diffusion models, Diffusers is a modular toolbox that supports both. Our library is designed with a focus on usability over performance, simple over easy, and customizability over abstractions. State-of-the-art diffusion pipelines that can be run in inference with just a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    ComfyUI SUPIR

    ComfyUI SUPIR

    SUPIR upscaling wrapper for ComfyUI

    The ComfyUI-SUPIR project is a ComfyUI integration of the SUPIR model, which is designed for high-quality image restoration and super-resolution. It enables users to enhance low-resolution or degraded images using advanced diffusion-based techniques. The integration provides nodes that allow users to control parameters such as noise levels, guidance strength, and output quality. It is particularly useful for workflows that require upscaling or restoring images before further processing. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    reverse-SynthID

    reverse-SynthID

    Reverse engineering Gemini's SynthID detection

    Reverse-SynthID is a research-focused project that analyzes and reverse-engineers Google’s SynthID watermarking system used in AI-generated images. It leverages signal processing and spectral analysis techniques to identify hidden watermark patterns without access to proprietary encoding methods. The project introduces a multi-resolution “SpectralCodebook” that maps watermark characteristics across different image sizes. Using this approach, it can detect SynthID watermarks with high accuracy and selectively reduce or remove them through frequency-domain manipulation. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    SageMaker Spark Container

    SageMaker Spark Container

    Docker image used to run data processing workloads

    Apache Spark™ is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    clip-retrieval

    clip-retrieval

    Easily compute clip embeddings and build a clip retrieval system

    ...It allows developers to compute embeddings for both images and text efficiently and then index them for fast similarity search across massive datasets. The system is optimized for performance and scalability, capable of processing tens or even hundreds of millions of embeddings using GPU acceleration. It includes components for inference, indexing, filtering, and serving results through APIs, making it a complete pipeline for building production-ready retrieval systems. The framework also supports querying by image, text, or embedding, enabling flexible use cases such as reverse image search or multimodal content discovery. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Depth Anything 3

    Depth Anything 3

    Recovering the Visual Space from Any Views

    Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    OpenAI Go

    OpenAI Go

    The official Go library for the OpenAI API

    ...It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also offers examples to help developers quickly set up projects and test different API calls. Designed for reliability and ease of use, it is maintained to stay aligned with the evolving OpenAI API specifications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    DeepSeek-OCR

    DeepSeek-OCR

    Contexts Optical Compression

    ...It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • 10
    ComfyUI Examples

    ComfyUI Examples

    Examples of ComfyUI workflows

    ComfyUI_examples is the companion repository for ComfyUI that collects ready-made example workflows, nodes, and compositions to help users learn the node-based interface for AI image generation. Instead of starting from an empty graph, you can open an example and see how prompts, samplers, models, and image processing steps are wired together. This makes ComfyUI more approachable for people coming from “one text box” generators, because they can reverse-engineer complex pipelines visually. The examples also serve as references for best practices like model loading order, latent handling, upscaling chains, and conditioning. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    Keras Hub

    Keras Hub

    Pretrained model hub for Keras 3

    Keras Hub is a repository of pre-trained models for Keras 3, offering a collection of ready-to-use models for various machine-learning tasks. KerasHub is an extension of the core Keras API; KerasHub components are provided as Layer and Model implementations. If you are familiar with Keras, congratulations. You already understand most of KerasHub.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    export_fig

    export_fig

    A MATLAB toolbox for exporting publication quality figures

    export_fig is a MATLAB toolbox designed to simplify high-quality figure export from MATLAB, faithfully preserving visual appearance as seen on-screen—including size, colours, fonts, and layout—while offering precise control over output formats such as PNG, PDF, EPS, and more. Exporting a figure from MATLAB the way you want it (hopefully the way it looks on screen), can be a real headache for the unitiated, thanks to all the settings that are required, and also due to some eccentricities...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Delphi-OpenCV

    Delphi-OpenCV

    Project Delphi-OpenCV. Translation of OpenCV library header files

    Delphi-OpenCV is a set of Object Pascal bindings and tools that bring OpenCV functionality into the Delphi development environment. It provides access to computer vision capabilities such as image processing, feature detection, and video analysis within native Delphi applications. The project includes sample programs, preconfigured packages, and integration with related libraries like SDL and OpenGL. It supports multiple OpenCV versions and includes resources such as Haar cascades for face detection tasks. Developers can use it to build desktop applications that leverage real-time vision processing. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    RestorePhotos.io

    RestorePhotos.io

    Restoring old and blurry face photos with AI

    RestorePhotos.io is an AI web app for restoring old, blurry, or low-quality face photos and bringing them back to life. It wraps the GFPGAN model (served via Replicate) behind a friendly Next.js front end, so non-technical users can upload an image and receive an enhanced version without ever touching ML code. The workflow is straightforward: you upload a photo, the serverless API route sends it to Replicate, and the restored image is returned and displayed in the UI. The project is production-oriented, not just a toy: it uses Bytescale for storage and image processing, Vercel for hosting and serverless functions, Auth.js + Neon for authentication and database, and Upstash Redis for rate limiting. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    fastdup

    fastdup

    An unsupervised and free tool for image and video dataset analysis

    fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    TIGRE

    TIGRE

    TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox

    TIGRE is an open-source toolbox for fast and accurate 3D tomographic reconstruction for any geometry. Its focus is on iterative algorithms for improved image quality that have all been optimized to run on GPUs (including multi-GPUs) for improved speed. It combines the higher-level abstraction of MATLAB or Python with the performance of CUDA at a lower level in order to make it both fast and easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Scrimage

    Scrimage

    JVM - Java, Kotlin, Scala image processing library

    Scrimage is an immutable, functional, and performant JVM library for the manipulation of images. The aim of this library is to provide a simple and concise way to do common image operations, such as resizing to fit a required width and height, converting between formats, applying filters, and so on. It is easy to use from any language on the JVM. A typical use case for this library would be creating thumbnails of images uploaded by users in a web app, bounding a set of product images so that...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PaddleNLP

    PaddleNLP

    Easy-to-use and powerful NLP library with Awesome model zoo

    PaddleNLP It is a natural language processing development library for flying paddles, with Easy-to-use text area API, Examples of applications for multiple scenarios, and High-performance distributed training Three major features, aimed at improving the modeling efficiency of the flying oar developer's text field, aiming to improve the developer's development efficiency in the text field, and provide rich examples of NLP applications. Provide rich industry-level pre-task capabilities...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding, cropping, resizing, and many other augmentations. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    ITK-SNAP is a tool for segmenting anatomical structures in medical images. It provides an automatic active contour segmentation pipeline, along with supporting manual segmentation toolbox. ITK-SNAP has a full-featured UI aimed at clinical researchers.
    Leader badge
    Downloads: 2,222 This Week
    Last Update:
    See Project
  • 21
    Google Highway

    Google Highway

    Performance-portable, length-agnostic SIMD with runtime dispatch

    ...This portability is achieved through dynamic or static dispatch mechanisms that select the best available instruction set at runtime or compile time. The library is designed for developers who need to maximize CPU performance in domains such as image processing, compression, cryptography, and scientific computing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OpenAI Quickstart Node

    OpenAI Quickstart Node

    Node.js example app from the OpenAI API quickstart tutorial

    ...The repository provides structured sample code for a variety of API endpoints, including chat completions, assistants, embeddings, fine-tuning, moderation, batch processing, and image generation. Each folder contains runnable scripts that demonstrate both basic usage and more advanced scenarios. By following the examples, developers can quickly understand how to authenticate with an API key, send requests, and handle responses within a Node.js environment. The project is a practical starting point for building AI-powered applications, serving as a foundation for experimentation and integration into larger projects. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    BookStack

    BookStack

    Simple & Free Wiki Software

    BookStack is a free and open source platform for storing and organising information and documentation. A self-hosted and opinionated wiki system, BookStack is simple and easy to use, giving even new users with just basic word-processing skills a pleasant out of the box experience. BookStack offers a relaxed, open and positive approach. While the platform can provide advanced power features to those who want them, it is primarily designed not to be extensible outside of its core purpose. That being said, BookStack already comes with plenty of powerful features, such as search and linking, cross-book sorting, image management and more. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    LocalAI

    LocalAI

    The free, Open Source alternative to OpenAI, Claude and others

    ...It acts as a drop-in replacement for APIs such as OpenAI, enabling developers to build AI-powered applications without relying on external cloud services. The platform supports a wide range of model types, including text generation, image creation, speech processing, and embeddings. LocalAI can run on consumer-grade hardware and does not necessarily require a GPU, making it accessible for local development and private deployments. It integrates with multiple backends like llama.cpp, transformers, and diffusers to support different AI workloads. With its self-hosted architecture and OpenAI-compatible API, LocalAI enables developers to build secure, local-first AI applications.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 25
    video-compare

    video-compare

    Split screen video comparison tool using FFmpeg and SDL2

    ...It uses FFmpeg libraries and SDL2 to decode and render synchronized playback, allowing precise frame-by-frame comparison. The tool supports videos with different formats, resolutions, and frame rates, making it versatile for testing encoding changes or processing effects. It includes interactive navigation controls and customizable display options to enhance analysis workflows. Users can visually compare codecs, scaling algorithms, and other transformations to evaluate quality differences. The application can also handle image sequences, extending its use beyond standard video files. Overall, it provides a powerful environment for visual quality assessment and debugging.
    Downloads: 7 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB