Showing 27 open source projects for "image manipulation"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Qwen-Image

    Qwen-Image

    Qwen-Image is a powerful image generation foundation model

    ...The model excels not only in text rendering but also in a wide range of artistic styles, including photorealistic, impressionist, anime, and minimalist aesthetics. Qwen-Image supports sophisticated editing tasks such as style transfer, object insertion and removal, detail enhancement, and even human pose manipulation, making it suitable for both professional and casual users. It also includes advanced image understanding capabilities like object detection, semantic segmentation, depth and edge estimation, and novel view synthesis.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    FaceFusion

    FaceFusion

    Industry leading face manipulation platform

    FaceFusion is an open-source face swapping and facial enhancement toolkit designed for high-quality video and image manipulation workflows. The project enables users to replace faces in images or videos while maintaining temporal consistency and visual realism. It integrates modern deep learning models for face detection, alignment, and blending to produce smoother results than traditional approaches. FaceFusion is built with a modular pipeline that allows users to customize processing steps and optimize performance for different hardware environments. ...
    Downloads: 165 This Week
    Last Update:
    See Project
  • 3
    SciPy

    SciPy

    SciPy library main repository

    ...SciPy is an open source software used in the fields of mathematics, science, and engineering, with modules for statistics, optimization, integration, linear algebra, signal and image processing, and many more. The SciPy library contains many of the user-friendly and efficient numerical routines, including those for numerical integration, interpolation, and optimization. SciPy is built to work with NumPy, a software that provides convenient and fast N-dimensional array manipulation. Both SciPy and NumPy run on all popular operating systems, are fast and easy to install, and are powerful yet easy to use. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4
    IOPaint

    IOPaint

    Image inpainting tool powered by SOTA AI Model

    IOPaint is a powerful open-source image editing tool focused on inpainting, outpainting, object removal, and general image manipulation driven by state-of-the-art AI models, delivering these capabilities through both local and hosted workflows. Designed to be fully self-hosted and flexible, IOPaint supports a variety of underlying generators and inpaint models — from LaMa erase networks to Stable Diffusion-based replace/object generation — giving users multiple ways to refine or reconstruct images by removing unwanted elements or expanding artwork beyond its original boundaries. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    MuJoCo Playground

    MuJoCo Playground

    An open source library for GPU-accelerated robot learning

    MuJoCo Playground, developed by Google DeepMind, is a GPU-accelerated suite of simulation environments for robot learning and sim-to-real research, built on top of MuJoCo MJX. It unifies a range of control, locomotion, and manipulation tasks into a consistent and scalable framework optimized for JAX and Warp backends. The project includes classic control benchmarks from dm_control, advanced quadruped and bipedal locomotion systems, and dexterous as well as non-prehensile manipulation setups. It also offers optional vision-based training capabilities through integration with Madrona-MJX, allowing researchers to train policies directly from image input on GPUs. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    DreamO

    DreamO

    A Unified Framework for Image Customization

    DreamO is a unified, open-source framework from ByteDance for advanced image customization and generation that consolidates multiple “image manipulation” tasks into a single system, rather than requiring separate specialized models. Built on a diffusion-transformer (DiT) backbone, it supports a diverse set of tasks — including identity preservation, virtual “try-on” (e.g. clothing, accessories), style transfer, IP adaptation (objects/characters), and layout/condition-aware customizations — all handled within the same unified architecture. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    pikepdf

    pikepdf

    A Python library for reading and writing PDF, powered by QPDF

    pikepdf is a Python library allowing the creation, manipulation, and repair of PDFs. It provides a Pythonic wrapper around the C++ PDF content transformation library, QPDF. Python + QPDF = “py” + “qpdf” = “pyqpdf”, which looks like a dyslexia test and is no fun to type. But say “pyqpdf” out loud, and it sounds like “pikepdf”. pikepdf is a library intended for developers who want to create, manipulate, parse, repair, and abuse the PDF format. It supports reading and write PDFs, including...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    InternGPT

    InternGPT

    Open source demo platform where you can easily showcase your AI models

    InternGPT is an open-source multimodal AI framework designed to extend large language models beyond text interactions into visual reasoning and image manipulation tasks. The system integrates conversational AI with computer vision models so users can interact with images, videos, and visual environments through natural language instructions. Unlike traditional chat systems that rely solely on text prompts, InternGPT allows users to interact with visual content using both language and nonverbal signals such as pointing or highlighting objects within images. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Imaging Instruments Lite

    Imaging Instruments Lite

    Image processing App for Windows Desktop

    Imaging Instruments lite is a comprehensive image processing application developed following the Model-View-Controller (MVC) design pattern, utilizing Python, Tkinter, and OpenCV. It provides users with image manipulation capabilities, leveraging multi-threading with OpenMP and GPU acceleration using CUDA-C. Fueled by yerba mate and a passion for coding. Created by Agustin Tortolero.
    Downloads: 1 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    stmani3

    stmani3

    Stereo Photo Manipulation

    A set of programs for Alignment and Rendering of still Stereo Photos (3D). This is a Python3 updated version of the old StMani
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    pdf combiner merger converter splitter

    pdf combiner merger converter splitter

    PDF Combiner is a user-friendly, GUI-based tool built in

    PDF Combiner is a user-friendly open source free to use, GUI-based tool for combining, pdf to excel, pdf to word, image to pdf, zip, unzip annotate and splitting PDF files. It is easy to use, supports multiple file insert and delete and process, and allows you to adjust the order of files before combining.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    DragGAN

    DragGAN

    Official Code for DragGAN (SIGGRAPH 2023)

    DragGAN is a research-driven image editing system that enables precise manipulation of GAN-generated images through interactive point dragging. The project introduces a novel workflow where users move specific points in an image and the model intelligently deforms the content while preserving realism. Built on top of StyleGAN architectures, the tool operates directly on the learned generative manifold to maintain photorealistic consistency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Glumpy

    Glumpy

    Python+Numpy+OpenGL, scalable and beautiful scientific visualization

    Glumpy is a Python library that simplifies the development of high-performance, interactive OpenGL visualizations. It abstracts complex OpenGL tasks into Pythonic constructs, making it easier for scientists, artists, and developers to harness the power of the GPU for real-time rendering and data visualization. Glumpy is particularly well-suited for rapid prototyping of graphical applications, and its integration with NumPy and shader programming makes it a powerful tool for both research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    ALAE

    ALAE

    Adversarial Latent Autoencoders

    ALAE (Adversarial Latent Autoencoders) is a deep learning research implementation that combines autoencoders with generative adversarial networks to produce high-quality image synthesis models. The project implements the architecture introduced in the CVPR research paper on Adversarial Latent Autoencoders, which focuses on improving generative modeling by learning latent representations aligned with adversarial training objectives. Unlike traditional GANs that directly generate images from...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    GIMP ML

    GIMP ML

    AI for GNU Image Manipulation Program

    This repository introduces GIMP3-ML, a set of Python plugins for the widely popular GNU Image Manipulation Program (GIMP). It enables the use of recent advances in computer vision to the conventional image editing pipeline. Applications from deep learning such as monocular depth estimation, semantic segmentation, mask generative adversarial networks, image super-resolution, de-noising and coloring have been incorporated with GIMP through Python-based plugins. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    OpenAI Glow

    OpenAI Glow

    Copy code in "Glow: Generative Flow with Invertible 1x1 Convolutions"

    Glow is an open source generative model released by OpenAI that demonstrates flow-based generative modeling techniques. Unlike models that rely on approximate inference, Glow uses invertible transformations to directly learn the data distribution, allowing for exact likelihood computation and efficient sampling. The model is capable of producing high-quality synthetic images while maintaining interpretable latent spaces that enable meaningful manipulation of generated outputs. Glow’s...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    StarGAN

    StarGAN

    Official PyTorch Implementation

    StarGAN is an implementation of the Star Generative Adversarial Network, a model designed for multi-domain image-to-image translation using a single unified GAN architecture. Unlike earlier GAN approaches that required separate models for each domain pair, StarGAN enables flexible attribute transfer across multiple domains within one network, significantly improving efficiency and scalability. The repository includes full training and inference pipelines for tasks such as facial attribute manipulation and style transfer. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Pydicom by examples

    Pydicom by examples

    Basic and intermediate examples of DICOM library with Jupyter

    Basic and intermediate examples to read, modify and write DICOM files with Python code using Jupyter - To install Jupyter - https://jupyter.org/install ====== All examples are based on Pydicom. An open source library - https://pydicom.github.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Neural Photo Editor

    Neural Photo Editor

    A simple interface for editing natural photos

    Neural Photo Editor is an experimental machine learning application that demonstrates how generative neural networks can be used as an interactive photo editing tool. The project implements the system described in the research paper Neural Photo Editing with Introspective Adversarial Networks, which introduces a generative model capable of modifying images in semantically meaningful ways. Instead of editing images by directly manipulating pixels, the software allows users to influence...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    allink

    allink

    Software for data analysis, image processing, simulations, solver.

    ...DrImage) image manipulation on the Matematica filters. The program is intended to use as less as possible external libs (optional: openGL, gsl, fftw, cgal, png, tiff, boost, MPI, Qt...).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Tesseract-gui
    Tessract-GUI is not a front-end for tesseract-ocr. It is just a graphical way to use it with simple image manipulation thru ImageMagick.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    GIIAF Microscopy Library

    GIIAF Microscopy Library

    The GIIAF Microscopy Library, that uses customised OMERO software

    This project incorporates a suite of tools that aim to allow researchers within Griffith's Imaging and Image Analysis Facility (GIIAF) to efficiently and effectively provide secure, centralised, web-accessible data storage, management and manipulation. The open-source Java-based OMERO software was customised to provide most of the features of this project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Image Processing software for combining information from images of the same scene, but with illumination variations or small changes. This work is the result of the coursework of an Image Processing regular course at IMPA.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    UIMP(Unified Image Manipulation Program) is an OS independent Image Processing system, which provides over 1000 algorithms and supports web online, console and GUI program.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A Python library for manipulation of WAD files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB