Showing 41 open source projects for "preprocessing image"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Watermark-Removal

    Watermark-Removal

    Machine learning image inpainting task that removes watermarks

    ...Through these techniques, the model learns to identify regions of the image affected by the watermark and generate realistic replacements for the missing visual information. The repository contains code for preprocessing images, training the model, and running inference on images to automatically remove watermark artifacts.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    FramePack

    FramePack

    Lets make video diffusion practical

    ...The repository demonstrates both packing and unpacking steps, making it straightforward to integrate into preprocessing pipelines. It’s useful for diffusion and generative models that learn from sequential image datasets, as well as classical pipelines that batch many related frames. With a simple API and examples, it invites experimentation on tradeoffs between compression, fidelity, and speed.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 3
    CLIP

    CLIP

    CLIP, Predict the most relevant text snippet given an image

    ...Once trained, you can give it any text labels and ask it to pick which label best matches a given image—even without explicit training for that classification task. The repository provides code for model architecture, preprocessing transforms, evaluation pipelines, and example inference scripts. Because it generalizes to arbitrary labels via text prompts, CLIP is a powerful tool for tasks that involve interpreting images in terms of descriptive language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    DeepSeek-OCR

    DeepSeek-OCR

    Contexts Optical Compression

    ...It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.
    Downloads: 16 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    GeoAI

    GeoAI

    GeoAI: Artificial Intelligence for Geospatial Data

    ...It provides a unified framework that combines machine learning libraries such as PyTorch and Transformers with geospatial tools, allowing users to process satellite imagery, aerial photos, and vector datasets in a streamlined workflow. The platform supports a wide range of tasks including image classification, object detection, segmentation, and change detection, making it suitable for applications in environmental monitoring, urban planning, and disaster response. GeoAI simplifies complex workflows by offering high-level APIs that abstract data preprocessing, model training, and inference, reducing the technical barrier for users who are not experts in both AI and geospatial systems.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    ComfyUI SUPIR

    ComfyUI SUPIR

    SUPIR upscaling wrapper for ComfyUI

    The ComfyUI-SUPIR project is a ComfyUI integration of the SUPIR model, which is designed for high-quality image restoration and super-resolution. It enables users to enhance low-resolution or degraded images using advanced diffusion-based techniques. The integration provides nodes that allow users to control parameters such as noise levels, guidance strength, and output quality. It is particularly useful for workflows that require upscaling or restoring images before further processing. The...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Computer Vision in Action

    Computer Vision in Action

    A computer vision closed-loop learning platform

    ...It serves as a hands-on companion for learners and engineers who want to understand not just the theory, but how computer vision is actually implemented for tasks like object detection, image classification, feature tracking, optical flow, and image segmentation. The repository includes structured code examples, scripts, and notebooks that cover pipeline construction, preprocessing, model inference, and visual output rendering, making it easy for newcomers or intermediate practitioners to adapt patterns to their own projects. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    TRELLIS.2

    TRELLIS.2

    Native and Compact Structured Latents for 3D Generation

    ...TRELLIS.2 emphasizes speed and compact latent representation, allowing bidirectional conversion between mesh formats and internal representations with minimal preprocessing and optimized performance on high-end GPUs.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 9
    3D Gaussian Splatting

    3D Gaussian Splatting

    Original reference implementation of "3D Gaussian Splatting"

    Gaussian Splatting is the official implementation of “3D Gaussian Splatting for Real-Time Radiance Field Rendering,” a research project for reconstructing and rendering 3D scenes from collections of images. The system represents scenes as millions of optimized 3D Gaussians rather than traditional meshes or neural fields, allowing high-quality novel view synthesis with real-time rendering performance. It includes training scripts, rendering tools, scene conversion utilities, and viewers for...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    Anime4KCPP

    Anime4KCPP

    A high performance anime upscaler

    Anime4KCPP provides an optimized bloc97's Anime4K algorithm version 0.9, and it also provides its own CNN algorithm ACNet, it provides a variety of way to use, including preprocessing and real-time playback, it aims to be a high-performance tool to process both image and video. This project is for learning and the exploration task of the algorithm course in SWJTU. Anime4K is a simple high-quality anime upscale algorithm. Version 0.9 does not use any machine learning approaches and can be very fast in real-time processing or pretreatment. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 11
    Unstructured.IO

    Unstructured.IO

    Open source libraries and APIs to build custom preprocessing pipelines

    The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data into...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SteadyDancer

    SteadyDancer

    Harmonized and Coherent Human Image Animation

    SteadyDancer is a research-oriented motion stabilization and dancer tracking system designed to analyze and correct motion in videos, making captured performances appear smoother and more stable while preserving expressiveness. It employs computer vision and motion modeling to estimate and reduce unwanted jitters, shakes, or camera wobbles — particularly in dance or movement sequences where traditional smoothing would distort intentional motion. By differentiating between intentional...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Kubeflow pipelines

    Kubeflow pipelines

    Machine Learning Pipelines for Kubeflow

    ...The pipeline includes the definition of the inputs (parameters) required to run the pipeline and the inputs and outputs of each component. A pipeline component is a self-contained set of user code, packaged as a Docker image, that performs one step in the pipeline. For example, a component can be responsible for data preprocessing, data transformation, model training, and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    ...DALI addresses the problem of the CPU bottleneck by offloading data preprocessing to the GPU. Additionally, DALI relies on its own execution engine, built to maximize the throughput of the input pipeline.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    unwarp

    Increase image resolution by eliminating atmospheric distortion

    Unwarp is an open-source tool that enhances image resolution by eliminating scintillations caused by atmospheric turbulence and similar distortion phenomena. The software processes a series of images of the same subject, aligning and stacking them using advanced feature selection algorithms and phase correlation approaches. The core technique matches features between images, applies triangulation across the entire frame, and warps each pixel to its optimal position. The resulting aligned...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    codenection

    We achieve connection by code, thus code-nection.

    ...The main goals are to enable code reuse, reduce the learning curve, minimize mistakes, and improve the efficiency within our team. We will leverage some highly useful toolkits, such as fmriprep, xcpd, qsiprep, etc., which are high-quality preprocessing and post-processing toolkits. We will also continue to expand our toolkit collection. Our aim is to establish a unified standard for image processing. Additionally, we will also implement some algorithms for data processing in the future. Hope that we can learn from each other, help one another, and achieve better versions of ourselves together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    TomoJ

    TomoJ

    ImageJ plugin to perform Electron Tomography

    ...Sorzano et al. https://doi.org/10.1016/j.yjsbx.2020.100037. BMC Bioinformatics. 2009 Apr 27;10:124."Marker-free image registration of electron tomography tilt-series." C.O.S. Sorzano et al. reconstruction part was described in: BMC Bioinformatics. 2007 Aug 6;8:288. "TomoJ: tomography software for three-dimensional reconstruction in transmission electron microscopy."Messaoudi C et al
    Downloads: 10 This Week
    Last Update:
    See Project
  • 18
    ToxTrac 2026

    ToxTrac 2026

    Free Animal Tracking Software

    ToxTrac is a free Windows program optimized for tracking animals. It uses an advanced tracking algorithm and includes Preprocessing, Background subtraction, Advanced collision and occlusion management, Post-processing, and Filters. It is robust; very fast; and can handle one or several animals in one or several environments. The program provides useful statistics as output. ToxTrac can be used for fish, insects, rodents, etc. If used, please cite: Rodriguez, A., Molares-Ulloa, A.,...
    Leader badge
    Downloads: 182 This Week
    Last Update:
    See Project
  • 19
    dashAI

    dashAI

    dashAI: an interactive platform for training, evaluating and deploying

    dashAI is an open-source, No-code workbench for Exploratory Data Analysis and classical ML. Visual data preparation, multi-model experiments, XAI explainability, and a plugin-based extensible catalog. The platform guides users through a complete, traceable workflow — data ingestion → visual exploration → preprocessing → model training → evaluation → explainability — without writing a single line of code. Each step is explicit and reversible, keeping the user in control rather than...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 20

    openSkyMatch

    Matches OpenScience Observatories images with astronomical catalogs

    openSkyMatch is a collection of Linux shell and Python scripts designed for the OpenScience Observatories program. It automates the identification and matching of detected celestial objects in locally captured FITS images with entries in large-scale sky catalogs, notably Pan-STARRS1 DR2 (II/389/ps1_dr2). The toolkit supports data preprocessing, coordinate correlation, and catalog-based validation of astronomical detections. All tools are open-source and optimized for reproducibility and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Glint Translator
    Glint Translator is a high-performance Windows application for real-time in-game and voice translation without interrupting gameplay. It supports 240+ languages using DeepL, Google, OpenAI, Azure, and Google Gemini models. The interface is available in 18 languages. Features • 3 Translation Modes: Fluent (parallel), Area (overlay), Full Screen (smart detection) • Speaker detection with color-coding • Glint AI custom terminology control • Game-based profile system • Advanced...
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 22
    Pigo

    Pigo

    Fast face detection, pupil/eyes localization

    Fast face detection, pupil/eyes localization and facial landmark points detection library in pure Go. Pigo is a pure Go face detection, pupil/eyes localization and facial landmark points detection library based on the Pixel Intensity Comparison-based Object detection paper. The reason why Pigo has been developed is because almost all of the currently existing solutions for face detection in the Go ecosystem are purely bindings to some C/C++ libraries like OpenCV or dlib, but calling a C...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23

    CCTV Frame Timestamp Extractor

    CCTV Footage Timestamp Search Tool

    ...Link to paper: https://link.springer.com/chapter/10.1007/978-3-031-10078-9_8 The project has been divided into four modules: Framextract.py- Extracts frames from video footages Reconstruct.py- Attempts to repair unplayable video by extracting the frames. framestitch.py- Attempts to construct video using frames extracted from unplayable video. OCR.py- Performs image preprocessing & OCR on the extracted frames.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SwiftOCR

    SwiftOCR

    Fast and simple OCR library written in Swift

    SwiftOCR is a fast and simple OCR library written in Swift. It uses a neural network for image recognition. As of now, SwiftOCR is optimized for recognizing short, one-line long alphanumeric codes (e.g. DI4C9CM). We currently support iOS and OS X. If you want to recognize normal text like a poem or a news article, go with Tesseract, but if you want to recognize short, alphanumeric codes (e.g. gift cards), I would advise you to choose SwiftOCR because that's where it exceeds. Tesseract is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    VGGFace2

    VGGFace2

    VGGFace2 Dataset for Face Recognition

    ...These models achieve strong verification performance on benchmarks such as IJB-B and include variants with lower-dimensional embeddings for compact feature representation. The project also includes preprocessing tools, face detection scripts, and etc.
    Downloads: 11 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next