image processing in java free download

scikit-image

Image processing in Python

scikit-image is a collection of algorithms for image processing. It is available free of charge and free of restriction. We pride ourselves on high-quality, peer-reviewed code, written by an active community of volunteers. scikit-image builds on scipy.ndimage to provide a versatile set of image processing routines in Python. This library is developed by its community, and contributions are most welcome!

Downloads: 1 This Week

Last Update: 2025-12-20

See Project

Deep-Live-Cam

Real time face swap and one-click video deepfake

Real time face swap and one-click video deepfake with only a single image. Choose a face (image with the desired face) and the target image/video (image/video in which you want to replace the face) and click on Start. Open File Explorer and navigate to the directory you select your output to be in. You will find a directory named <video_title> where you can see the frames being swapped in real time. Once the processing is done, it will create the output file.

1 Review

Downloads: 406 This Week

Last Update: 2026-05-17

See Project

AUTOMATIC1111 Stable Diffusion web UI

Stable Diffusion web UI

AUTOMATIC1111's stable-diffusion-webui is a powerful, user-friendly web interface built on the Gradio library that allows users to easily interact with Stable Diffusion models for AI-powered image generation. Supporting both text-to-image (txt2img) and image-to-image (img2img) generation, this open-source UI offers a rich feature set including inpainting, outpainting, attention control, and multiple advanced upscaling options. With a flexible installation process across Windows, Linux, and Apple Silicon, plus support for GPUs and CPUs, it caters to a wide range of users—from hobbyists to professionals. ...

1 Review

Downloads: 198 This Week

Last Update: 2025-06-02

See Project

FaceFusion

Industry leading face manipulation platform

FaceFusion is an open-source face swapping and facial enhancement toolkit designed for high-quality video and image manipulation workflows. The project enables users to replace faces in images or videos while maintaining temporal consistency and visual realism. It integrates modern deep learning models for face detection, alignment, and blending to produce smoother results than traditional approaches. FaceFusion is built with a modular pipeline that allows users to customize processing steps and optimize performance for different hardware environments. ...

Downloads: 536 This Week

Last Update: 11 hours ago

See Project

Kornia

Open Source Differentiable Computer Vision Library

...Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors. With Kornia we fill the gap between classical and deep computer vision that implements standard and advanced vision algorithms for AI. Our libraries and initiatives are always according to the community needs.

Downloads: 0 This Week

Last Update: 2026-05-19

See Project

SD.Next

All-in-one WebUI for AI generative image and video creation

SD.Next is an all-in-one web user interface for generative image creation that expands beyond basic Stable Diffusion workflows to cover broader image and video generation, captioning, and processing tasks. It is designed as a power-user environment where model management, generation features, and workflow controls are centralized in a single UI rather than spread across separate scripts and utilities.

Downloads: 15 This Week

Last Update: 2026-07-14

See Project

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files

OCRmyPDF adds an optical character recognition (OCR) text layer to scanned PDF files, allowing them to be searched. PDF is the best format for storing and exchanging scanned documents. Unfortunately, PDFs can be difficult to modify. OCRmyPDF makes it easy to apply image processing and OCR (recognized, searchable text) to existing PDFs.

Downloads: 111 This Week

Last Update: 2026-07-17

See Project

Dream Textures

Stable Diffusion built-in to Blender

...Outpaint to increase the size of an image by extending it in any direction. Perform style transfer and create novel animations with Stable Diffusion as a post processing step. Dream Textures has been tested with CUDA and Apple Silicon GPUs. Over 4GB of VRAM is recommended.

Downloads: 3 This Week

Last Update: 2024-08-26

See Project

Sygil WebUI

Stable Diffusion web UI

Sygil WebUI is a browser-based interface for running Stable Diffusion image generation locally or on a server, wrapping common text-to-image and image-to-image workflows into a practical UI. It provides multiple UI modes (including a legacy Gradio interface) and focuses on making iterative prompting, parameter tuning, and post-processing accessible without writing code. The UI exposes core generation controls like resolution, CFG guidance, sampling steps, samplers, seeds, and batch generation so users can reproduce results and refine outputs systematically. ...

Downloads: 0 This Week

Last Update: 2026-07-17

See Project

SwarmUI

Modular AI image and video generation web UI with extensible tools

SwarmUI is a modular web-based user interface designed for AI-driven image generation, with a strong focus on usability, performance, and extensibility. It serves as a unified environment for working with multiple AI models, including Stable Diffusion and newer image and video generation systems, allowing users to create and manage outputs through a browser interface. SwarmUI is built to accommodate both beginners and advanced users by offering a simple “Generate” interface alongside more...

Downloads: 11 This Week

Last Update: 2026-03-18

See Project

Milvus Bootcamp

Dealing with all unstructured data, such as reverse image search

Milvus Bootcamp is a collection of tutorials, examples, and best practices for using Milvus, an open-source vector database designed for AI-powered similarity search and retrieval applications.

Downloads: 0 This Week

Last Update: 2025-05-22

See Project

reverse-SynthID

Reverse engineering Gemini's SynthID detection

Reverse-SynthID is a research-focused project that analyzes and reverse-engineers Google’s SynthID watermarking system used in AI-generated images. It leverages signal processing and spectral analysis techniques to identify hidden watermark patterns without access to proprietary encoding methods. The project introduces a multi-resolution “SpectralCodebook” that maps watermark characteristics across different image sizes. Using this approach, it can detect SynthID watermarks with high accuracy and selectively reduce or remove them through frequency-domain manipulation. ...

Downloads: 6 This Week

Last Update: 2026-04-23

See Project

Keras Hub

Pretrained model hub for Keras 3

Keras Hub is a repository of pre-trained models for Keras 3, offering a collection of ready-to-use models for various machine-learning tasks. KerasHub is an extension of the core Keras API; KerasHub components are provided as Layer and Model implementations. If you are familiar with Keras, congratulations. You already understand most of KerasHub.

Downloads: 1 This Week

Last Update: 2026-07-24

See Project

Depth Anything 3

Recovering the Visual Space from Any Views

Depth Anything 3 is a research-driven project that brings accurate and dense depth estimation to any input image or video, enabling foundational understanding of 3D structure from 2D visual content. Designed to work across diverse scenes, lighting conditions, and image types, it uses advanced neural networks trained on large, heterogeneous datasets, producing depth maps that reveal scene depth relationships and object surfaces with strong fidelity.

Downloads: 5 This Week

Last Update: 4 days ago

See Project

Unstructured.IO

Open source libraries and APIs to build custom preprocessing pipelines

The unstructured library provides open-source components for ingesting and pre-processing images and text documents, such as PDFs, HTML, Word docs, and many more. The use cases of unstructured revolve around streamlining and optimizing the data processing workflow for LLMs. unstructured modular bricks and connectors form a cohesive system that simplifies data ingestion and pre-processing, making it adaptable to different platforms and is efficient in transforming unstructured data into...

Downloads: 0 This Week

Last Update: 2026-07-11

See Project

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system

...It allows developers to compute embeddings for both images and text efficiently and then index them for fast similarity search across massive datasets. The system is optimized for performance and scalability, capable of processing tens or even hundreds of millions of embeddings using GPU acceleration. It includes components for inference, indexing, filtering, and serving results through APIs, making it a complete pipeline for building production-ready retrieval systems. The framework also supports querying by image, text, or embedding, enabling flexible use cases such as reverse image search or multimodal content discovery. ...

Downloads: 0 This Week

Last Update: 2026-03-18

See Project

Spring AI Alibaba Examples

Spring AI Alibaba examples for building and testing AI apps

...It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. The examples highlight how to integrate AI models, manage prompts, handle memory, and build multi-model or multi-agent workflows. Developers can explore individual project folders for detailed instructions and implementation guidance. Spring AI Alibaba Examples also supports experimentation through playground modules and encourages contributions to expand real-world AI use cases and improve development practices.

1 Review

Downloads: 1 This Week

Last Update: 4 days ago

See Project

DeepSeek-OCR

Contexts Optical Compression

...It is designed to extract text from images, PDFs, and scanned documents, and integrates with multimodal capabilities that understand layout, context, and visual elements beyond raw character recognition. The system treats OCR not simply as “read the text” but as “understand what the text is doing in the image”—for example distinguishing captions from body text, interpreting tables, or recognizing handwritten versus printed words. It supports local deployment, enabling organizations concerned about privacy or latency to run the pipeline on-premises rather than send sensitive documents to third-party cloud services. The codebase is written in Python with a focus on modularity: you can swap preprocessing, recognition, and post-processing components as needed for custom workflows.

Downloads: 4 This Week

Last Update: 2026-01-27

See Project

PaddleNLP

Easy-to-use and powerful NLP library with Awesome model zoo

PaddleNLP It is a natural language processing development library for flying paddles, with Easy-to-use text area API, Examples of applications for multiple scenarios, and High-performance distributed training Three major features, aimed at improving the modeling efficiency of the flying oar developer's text field, aiming to improve the developer's development efficiency in the text field, and provide rich examples of NLP applications. Provide rich industry-level pre-task capabilities...

Downloads: 0 This Week

Last Update: 2025-05-21

See Project

Bonsai 27B

Run Bonsai (1-bit) and Ternary-Bonsai language models locally

Bonsai 27B is a repository for downloading, configuring, and running PrismML’s highly compressed Bonsai language models on local hardware. It supports the 1-bit Bonsai and higher-quality Ternary-Bonsai families in 1.7B, 4B, 8B, and 27B sizes. The models can run on macOS, Linux, and Windows through CPU, Metal, CUDA, Vulkan, ROCm, llama.cpp, or MLX backends. Its 27B models process text, images, screenshots, and PDFs while supporting reasoning and long-context conversations. They also provide...

Downloads: 166 This Week

Last Update: 1 day ago

See Project

HunyuanDiT

Diffusion Transformer with Fine-Grained Chinese Understanding

HunyuanDiT is a high-capability text-to-image diffusion transformer with bilingual (Chinese/English) understanding and multi-turn dialogue capability. It trains a diffusion model in latent space using a transformer backbone and integrates a Multimodal Large Language Model (MLLM) to refine captions and support conversational image generation. It supports adapters like ControlNet, IP-Adapter, LoRA, and can run under constrained VRAM via distillation versions. LoRA, ControlNet (pose, depth,...

Downloads: 1 This Week

Last Update: 2025-11-27

See Project

POT

Python Optimal Transport

This open source Python library provides several solvers for optimization problems related to Optimal Transport for signal, image processing and machine learning.

Downloads: 2 This Week

Last Update: 2 days ago

See Project

Depth Pro

Sharp Monocular Metric Depth in Less Than a Second

Depth Pro is a foundation model for zero-shot metric monocular depth estimation, producing sharp, high-frequency depth maps with absolute scale from a single image. Unlike many prior approaches, it does not require camera intrinsics or extra metadata, yet still outputs metric depth suitable for downstream 3D tasks. Apple highlights both accuracy and speed: the model can synthesize a ~2.25-megapixel depth map in around 0.3 seconds on a standard GPU, enabling near real-time applications. The...

Downloads: 8 This Week

Last Update: 2025-10-08

See Project

HivisionIDPhoto

HivisionIDPhotos: a lightweight and efficient AI ID photos tools

...It also allows the generation of layout sheets such as six-inch photo arrangements for printing multiple ID photos on a single page. The project focuses on building a practical pipeline for automated ID photo production using AI-based segmentation and image processing techniques.

Downloads: 4 This Week

Last Update: 2026-03-10

See Project

ModelScope

Bring the notion of Model-as-a-Service to life

ModelScope is built upon the notion of “Model-as-a-Service” (MaaS). It seeks to bring together most advanced machine learning models from the AI community, and streamlines the process of leveraging AI models in real-world applications. The core ModelScope library open-sourced in this repository provides the interfaces and implementations that allow developers to perform model inference, training and evaluation. In particular, with rich layers of API abstraction, the ModelScope library offers...

Downloads: 4 This Week

Last Update: 2026-07-22

See Project

Search Results for "image processing in java"

Showing 77 open source projects for "image processing in java"

scikit-image

Deep-Live-Cam

AUTOMATIC1111 Stable Diffusion web UI

FaceFusion

Kornia

SD.Next

OCRmyPDF

Dream Textures

Sygil WebUI

SwarmUI

Milvus Bootcamp

reverse-SynthID

Keras Hub

Depth Anything 3

Unstructured.IO

clip-retrieval

Spring AI Alibaba Examples

DeepSeek-OCR

PaddleNLP

Bonsai 27B

HunyuanDiT

POT

Depth Pro

HivisionIDPhoto

ModelScope

Search Results for "image processing in java"

Showing 77 open source projects for "image processing in java"

scikit-image

Deep-Live-Cam

AUTOMATIC1111 Stable Diffusion web UI

FaceFusion

Kornia

SD.Next

OCRmyPDF

Dream Textures

Sygil WebUI

SwarmUI

Milvus Bootcamp

reverse-SynthID

Keras Hub

Depth Anything 3

Unstructured.IO

clip-retrieval

Spring AI Alibaba Examples

DeepSeek-OCR

PaddleNLP

Bonsai 27B

HunyuanDiT

POT

Depth Pro

HivisionIDPhoto

ModelScope

Related Searches

Related Categories