Showing 38 open source projects for "preprocessing image"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Watermark-Removal

    Watermark-Removal

    Machine learning image inpainting task that removes watermarks

    ...Through these techniques, the model learns to identify regions of the image affected by the watermark and generate realistic replacements for the missing visual information. The repository contains code for preprocessing images, training the model, and running inference on images to automatically remove watermark artifacts.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    CLIP

    CLIP

    CLIP, Predict the most relevant text snippet given an image

    ...Once trained, you can give it any text labels and ask it to pick which label best matches a given image—even without explicit training for that classification task. The repository provides code for model architecture, preprocessing transforms, evaluation pipelines, and example inference scripts. Because it generalizes to arbitrary labels via text prompts, CLIP is a powerful tool for tasks that involve interpreting images in terms of descriptive language.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Computer Vision in Action

    Computer Vision in Action

    A computer vision closed-loop learning platform

    ...It serves as a hands-on companion for learners and engineers who want to understand not just the theory, but how computer vision is actually implemented for tasks like object detection, image classification, feature tracking, optical flow, and image segmentation. The repository includes structured code examples, scripts, and notebooks that cover pipeline construction, preprocessing, model inference, and visual output rendering, making it easy for newcomers or intermediate practitioners to adapt patterns to their own projects. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    FramePack

    FramePack

    Lets make video diffusion practical

    ...The repository demonstrates both packing and unpacking steps, making it straightforward to integrate into preprocessing pipelines. It’s useful for diffusion and generative models that learn from sequential image datasets, as well as classical pipelines that batch many related frames. With a simple API and examples, it invites experimentation on tradeoffs between compression, fidelity, and speed.
    Downloads: 14 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    DeepSeek-OCR

    DeepSeek-OCR

    Contexts Optical Compression

    ...It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    GeoAI

    GeoAI

    GeoAI: Artificial Intelligence for Geospatial Data

    ...It provides a unified framework that combines machine learning libraries such as PyTorch and Transformers with geospatial tools, allowing users to process satellite imagery, aerial photos, and vector datasets in a streamlined workflow. The platform supports a wide range of tasks including image classification, object detection, segmentation, and change detection, making it suitable for applications in environmental monitoring, urban planning, and disaster response. GeoAI simplifies complex workflows by offering high-level APIs that abstract data preprocessing, model training, and inference, reducing the technical barrier for users who are not experts in both AI and geospatial systems.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    ComfyUI SUPIR

    ComfyUI SUPIR

    SUPIR upscaling wrapper for ComfyUI

    The ComfyUI-SUPIR project is a ComfyUI integration of the SUPIR model, which is designed for high-quality image restoration and super-resolution. It enables users to enhance low-resolution or degraded images using advanced diffusion-based techniques. The integration provides nodes that allow users to control parameters such as noise levels, guidance strength, and output quality. It is particularly useful for workflows that require upscaling or restoring images before further processing. The...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    TRELLIS 2

    TRELLIS 2

    Native and Compact Structured Latents for 3D Generation

    ...TRELLIS.2 emphasizes speed and compact latent representation, allowing bidirectional conversion between mesh formats and internal representations with minimal preprocessing and optimized performance on high-end GPUs.
    Downloads: 38 This Week
    Last Update:
    See Project
  • 9
    Unstructured.IO

    Unstructured.IO

    Open source libraries and APIs to build custom preprocessing pipelines

    The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data into...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Anime4KCPP

    Anime4KCPP

    A high performance anime upscaler

    Anime4KCPP provides an optimized bloc97's Anime4K algorithm version 0.9, and it also provides its own CNN algorithm ACNet, it provides a variety of way to use, including preprocessing and real-time playback, it aims to be a high-performance tool to process both image and video. This project is for learning and the exploration task of the algorithm course in SWJTU. Anime4K is a simple high-quality anime upscale algorithm. Version 0.9 does not use any machine learning approaches and can be very fast in real-time processing or pretreatment. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 11
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    ...DALI addresses the problem of the CPU bottleneck by offloading data preprocessing to the GPU. Additionally, DALI relies on its own execution engine, built to maximize the throughput of the input pipeline.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    SteadyDancer

    SteadyDancer

    Harmonized and Coherent Human Image Animation

    SteadyDancer is a research-oriented motion stabilization and dancer tracking system designed to analyze and correct motion in videos, making captured performances appear smoother and more stable while preserving expressiveness. It employs computer vision and motion modeling to estimate and reduce unwanted jitters, shakes, or camera wobbles — particularly in dance or movement sequences where traditional smoothing would distort intentional motion. By differentiating between intentional...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Kubeflow pipelines

    Kubeflow pipelines

    Machine Learning Pipelines for Kubeflow

    ...The pipeline includes the definition of the inputs (parameters) required to run the pipeline and the inputs and outputs of each component. A pipeline component is a self-contained set of user code, packaged as a Docker image, that performs one step in the pipeline. For example, a component can be responsible for data preprocessing, data transformation, model training, and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    unwarp

    Increase image resolution by eliminating atmospheric distortion

    Unwarp is an open-source tool that enhances image resolution by eliminating scintillations caused by atmospheric turbulence and similar distortion phenomena. The software processes a series of images of the same subject, aligning and stacking them using advanced feature selection algorithms and phase correlation approaches. The core technique matches features between images, applies triangulation across the entire frame, and warps each pixel to its optimal position. The resulting aligned...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    codenection

    We achieve connection by code, thus code-nection.

    ...The main goals are to enable code reuse, reduce the learning curve, minimize mistakes, and improve the efficiency within our team. We will leverage some highly useful toolkits, such as fmriprep, xcpd, qsiprep, etc., which are high-quality preprocessing and post-processing toolkits. We will also continue to expand our toolkit collection. Our aim is to establish a unified standard for image processing. Additionally, we will also implement some algorithms for data processing in the future. Hope that we can learn from each other, help one another, and achieve better versions of ourselves together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    TomoJ

    TomoJ

    ImageJ plugin to perform Electron Tomography

    ...Sorzano et al. https://doi.org/10.1016/j.yjsbx.2020.100037. BMC Bioinformatics. 2009 Apr 27;10:124."Marker-free image registration of electron tomography tilt-series." C.O.S. Sorzano et al. reconstruction part was described in: BMC Bioinformatics. 2007 Aug 6;8:288. "TomoJ: tomography software for three-dimensional reconstruction in transmission electron microscopy."Messaoudi C et al
    Downloads: 19 This Week
    Last Update:
    See Project
  • 17
    ToxTrac

    ToxTrac

    Free Animal Tracking Software

    ToxTrac is a free Windows program optimized for tracking animals. It uses an advanced tracking algorithm and includes Preprocessing, Background subtraction, Thresholding, Post-processing, and Filters. It is robust; very fast; and can handle one or several animals in one or several environments. The program provides useful statistics as output. ToxTrac can be used for fish, insects, rodents, etc. If used, please cite: Rodriguez, A., Zhang, H., Klaminder, J., Brodin, T., Andersson, P....
    Leader badge
    Downloads: 161 This Week
    Last Update:
    See Project
  • 18

    openSkyMatch

    Matches OpenScience Observatories images with astronomical catalogs

    openSkyMatch is a collection of Linux shell and Python scripts designed for the OpenScience Observatories program. It automates the identification and matching of detected celestial objects in locally captured FITS images with entries in large-scale sky catalogs, notably Pan-STARRS1 DR2 (II/389/ps1_dr2). The toolkit supports data preprocessing, coordinate correlation, and catalog-based validation of astronomical detections. All tools are open-source and optimized for reproducibility and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Glint Translator
    Glint Translator is a high-performance Windows application for real-time in-game and voice translation without interrupting gameplay. It supports 240+ languages using DeepL, Google, OpenAI, Azure, and Google Gemini models. The interface is available in 18 languages. Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 20
    Pigo

    Pigo

    Fast face detection, pupil/eyes localization

    Fast face detection, pupil/eyes localization and facial landmark points detection library in pure Go. Pigo is a pure Go face detection, pupil/eyes localization and facial landmark points detection library based on the Pixel Intensity Comparison-based Object detection paper. The reason why Pigo has been developed is because almost all of the currently existing solutions for face detection in the Go ecosystem are purely bindings to some C/C++ libraries like OpenCV or dlib, but calling a C...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    CCTV Frame Timestamp Extractor

    CCTV Footage Timestamp Search Tool

    ...Link to paper: https://link.springer.com/chapter/10.1007/978-3-031-10078-9_8 The project has been divided into four modules: Framextract.py- Extracts frames from video footages Reconstruct.py- Attempts to repair unplayable video by extracting the frames. framestitch.py- Attempts to construct video using frames extracted from unplayable video. OCR.py- Performs image preprocessing & OCR on the extracted frames.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SwiftOCR

    SwiftOCR

    Fast and simple OCR library written in Swift

    SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    VGGFace2

    VGGFace2

    VGGFace2 Dataset for Face Recognition

    ...These models achieve strong verification performance on benchmarks such as IJB-B and include variants with lower-dimensional embeddings for compact feature representation. The project also includes preprocessing tools, face detection scripts, and etc.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 24
    Image Quality Assessment

    Image Quality Assessment

    Convolutional Neural Networks to predict aesthetic quality of images

    ...The goal of the project is to automatically evaluate images based on perceived quality factors such as composition, clarity, and visual appeal. Instead of relying on simple image statistics, the system learns patterns that correlate with human judgments about image aesthetics and technical quality. The repository includes code for training models, performing inference, and evaluating predicted scores against labeled datasets. It also provides utilities for image preprocessing and data management that help prepare datasets for training deep learning models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Deep Learning for Medical Applications

    Deep Learning for Medical Applications

    Deep Learning Papers on Medical Image Analysis

    Deep-Learning-for-Medical-Applications is a repository that compiles deep learning methods, code implementations, and examples applied to medical imaging and healthcare data. The project addresses domain-specific challenges like segmentation, classification, detection, and multimodal data (e.g. MRI, CT, X-ray) using state-of-the-art architectures (e.g. U-Net, ResNet, GAN variants) tailored to medical constraints (small datasets, annotation costs, class imbalance). It includes Jupyter...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB