Search Results for "python image editor" - Page 3

Sort By:

Showing 1296 open source projects for "python image editor"

View related business solutions

Python Clear Filters & Widen Search

Cloud-based help desk software with ServoDesk
Full access to Enterprise features. No credit card required.

What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.

Try ServoDesk for free
Build Securely on Azure with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
1

ComfyUI

The most powerful and modular diffusion model GUI, api and backend

The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...

Downloads: 218 This Week

Last Update: 2 days ago
See Project
2

SAM 3D Body

Code for running inference with the SAM 3D Body Model 3DB

SAM 3D Body is a promptable model for single-image full-body 3D human mesh recovery, designed to estimate detailed human pose and shape from just one RGB image. It reconstructs the full body, including feet and hands, using the Momentum Human Rig (MHR), a parametric mesh representation that decouples skeletal structure from surface shape for more accurate and interpretable results.

Downloads: 5 This Week

Last Update: 2 days ago
See Project
3

HunyuanWorld 1.0

Generating Immersive, Explorable, and Interactive 3D Worlds

HunyuanWorld-1.0 is an open-source, simulation-capable 3D world generation model developed by Tencent Hunyuan that creates immersive, explorable, and interactive 3D environments from text or image inputs. It combines the strengths of video-based diversity and 3D-based geometric consistency through a novel framework using panoramic world proxies and semantically layered 3D mesh representations. This approach enables 360° immersive experiences, seamless mesh export for graphics pipelines, and...

Downloads: 12 This Week

Last Update: 2025-10-22
See Project
4

Screenshot to Code

A neural network that transforms a design mock-up into static websites

Screenshot-to-code is a tool or prototype that attempts to convert UI screenshots (e.g., of mobile or web UIs) into code representations, likely generating layouts, HTML, CSS, or markup from image inputs. It is part of a research/proof-of-concept domain in UI automation and image-to-UI code generation. Mapping visual design to code constructs. Code/UI layout (HTML, CSS, or markup). Examples/demo scripts showing “image UI code”.

Downloads: 0 This Week

Last Update: 2025-09-26
See Project
Total Network Visibility for Network Engineers and IT Managers
Network monitoring and troubleshooting is hard. TotalView makes it easy.

This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.

Learn More
5

CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

CogView4 is the latest generation in the CogView series of vision-language foundation models, developed as a bilingual (Chinese and English) open-source system for high-quality image understanding and generation. Built on top of the GLM framework, it supports multimodal tasks including text-to-image synthesis, image captioning, and visual reasoning. Compared to previous CogView versions, CogView4 introduces architectural upgrades, improved training pipelines, and larger-scale datasets,...

Downloads: 3 This Week

Last Update: 2 days ago
See Project
6

Pixelorama

A free & open-source 2D sprite editor, made with the Godot Engine

Pixelorama is a free and open-source pixel art editor, proudly created with the Godot Engine, by Orama Interactive. Whether you want to make animated pixel art, game graphics, tiles and any kind of pixel art you want, Pixelorama has you covered with its variety of tools and features. Free to use for everyone, forever. A variety of different tools to help you draw, with the ability to map a different tool in each left and right mouse buttons. Are you an animator? Pixelorama has its own...

Downloads: 77 This Week

Last Update: 2025-10-31
See Project
7

SAM 3D Objects

Models for object and human mesh reconstruction

SAM 3D Objects is a foundation model that reconstructs full 3D geometry, texture, and spatial layout of objects and scenes from a single image. Given one RGB image and object masks (for example, from the Segment Anything family), it can generate a textured 3D mesh for each object, including pose and approximate scene layout. The model is specifically designed to be robust in real-world images with clutter, occlusions, small objects, and unusual viewpoints, where many earlier 3D-from-image...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
8

TexText

Re-editable LaTeX/ typst graphics for Inkscape

Re-editable LaTeX and typst graphics for Inkscape. TexText is a Python extension for the vector graphics editor Inkscape providing the possibility to add and re-edit LaTeX and typst generated SVG elements to your drawing.

Downloads: 2 This Week

Last Update: 2025-07-17
See Project
9

Typer

Typer, build great CLIs, based on Python type hints

Typer is a library for building CLI applications that users will love using and developers will love creating. Based on Python 3.6+ type hints. Great editor support. Completion everywhere. Less time debugging. Designed to be easy to use and learn. Less time reading docs. It's easy to use for the final users. Automatic help, and automatic completion for all shells. Minimize code duplication. Multiple features from each parameter declaration. Fewer bugs. The simplest example adds only 2 lines of code to your app: 1 import, 1 function call. ...

Downloads: 1 This Week

Last Update: 2025-10-20
See Project
Contract Automation Made Easy
Use Docubee to easily gather data, generate contracts, share them your way, and collect secure eSignatures

Docubee is an intelligent contract automation platform that allows you to quickly and painlessly generate, manage, share, and sign contracts. Featuring powerful conditional logic-based workflows, generative AI technology, and an easily adaptable interface, Docubee makes it easy to automate your most complex contracts and agreements.

Learn More
10

FastAPI

FastAPI framework, high performance, easy to learn, fast to code

FastAPI is a modern, fast (high-performance), web framework for building APIs with Python 3.6+ based on standard Python type hints. Great editor support. Completion everywhere. Less time debugging. Designed to be easy to use and learn. Less time reading docs. Minimize code duplication. Multiple features from each parameter declaration. Fewer bugs. Get production-ready code. With automatic interactive documentation. Based on (and fully compatible with) the open standards for APIs: OpenAPI (previously known as Swagger) and JSON Schema. ...

Downloads: 42 This Week

Last Update: 3 days ago
See Project
11

Wan2.1

Wan2.1: Open and Advanced Large-Scale Video Generative Model

Wan2.1 is a foundational open-source large-scale video generative model developed by the Wan team, providing high-quality video generation from text and images. It employs advanced diffusion-based architectures to produce coherent, temporally consistent videos with realistic motion and visual fidelity. Wan2.1 focuses on efficient video synthesis while maintaining rich semantic and aesthetic detail, enabling applications in content creation, entertainment, and research. The model supports...

1 Review

Downloads: 31 This Week

Last Update: 2025-09-23
See Project
12

MGIE

Guiding Instruction-based Image Editing via Multimodal Large Language

MGIE—Guiding Instruction-based Image Editing—demonstrates how a multimodal LLM can parse natural-language editing instructions and then drive image transformations accordingly. The project focuses on making edits explainable and controllable: the model interprets text guidance, reasons over image content, and outputs edits aligned with user intent. It’s positioned as an ICLR 2024 Spotlight work, with code and references that show how to connect language planning to concrete image operations....

Downloads: 0 This Week

Last Update: 2025-10-08
See Project
13

CLIP

CLIP, Predict the most relevant text snippet given an image

CLIP (Contrastive Language-Image Pretraining) is a neural model that links images and text in a shared embedding space, allowing zero-shot image classification, similarity search, and multimodal alignment. It was trained on large sets of (image, caption) pairs using a contrastive objective: images and their matching text are pulled together in embedding space, while mismatches are pushed apart. Once trained, you can give it any text labels and ask it to pick which label best matches a given...

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
14

YAPF

A formatter for Python files

YAPF is a Python code formatter that automatically rewrites source to match a chosen style, using a clang-format–inspired algorithm to search for the “best” layout under your rules. Instead of relying on a fixed set of heuristics, it explores formatting decisions and chooses the lowest-cost result, aiming to produce code a human would write when following a style guide. You can run it as a command-line tool or call it as a library via FormatCode / FormatFile, making it easy to embed in...

Downloads: 8 This Week

Last Update: 2 days ago
See Project
15

CleanVision

Automatically find issues in image datasets

CleanVision automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc. This data-centric AI package is a quick first step for any computer vision project to find problems in the dataset, which you want to address before applying machine learning. CleanVision is super simple -- run the same couple lines of Python code to audit any image dataset!

Downloads: 1 This Week

Last Update: 2024-02-13
See Project
16

Albumentations

Fast image augmentation library and an easy-to-use wrapper

Albumentations is a computer vision tool that boosts the performance of deep convolutional neural networks. Albumentations is a Python library for fast and flexible image augmentations. Albumentations efficiently implements a rich variety of image transform operations that are optimized for performance, and does so while providing a concise, yet powerful image augmentation interface for different computer vision tasks, including object classification, segmentation, and detection. ...

Downloads: 1 This Week

Last Update: 2025-05-27
See Project
17

GraalPy

A Python 3 implementation built on GraalVM

GraalPy is a high-performance implementation of the Python language for the JVM built on GraalVM. GraalPy is a Python 3.11 compliant runtime. It has first-class support for embedding in Java and can turn Python applications into fast, standalone binaries. GraalPy is ready for production running pure Python code and has experimental support for many popular native extension modules.

Downloads: 3 This Week

Last Update: 2025-10-21
See Project
18

Django Filer

File and Image Management Application for django

django Filer is a file management application for django that makes handling files and images a breeze. django-filer is a file management application for django. It handles uploading and organizing files and images in contrib.admin. Custom model fields are provided for use in 3rd party apps as a replacement for the default FileField from django. Behind the scenes a ForeignKey to the File model is used. It is possible to define the important part of an image (the subject location) in the...

Downloads: 0 This Week

Last Update: 2025-11-07
See Project
19

Kornia

Open Source Differentiable Computer Vision Library

Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions. Inspired by existing packages, this library is composed by a subset of packages containing operators that can be inserted within...

Downloads: 0 This Week

Last Update: 2025-11-08
See Project
20

HunyuanVideo-I2V

A Customizable Image-to-Video Model based on HunyuanVideo

HunyuanVideo-I2V is a customizable image-to-video generation framework from Tencent Hunyuan, built on their HunyuanVideo foundation. It extends video generation so that given a static reference image plus an optional prompt, it generates a video sequence that preserves the reference image’s identity (especially in the first frame) and allows stylized effects via LoRA adapters. The repository includes pretrained weights, inference and sampling scripts, training code for LoRA effects, and...

Downloads: 1 This Week

Last Update: 2025-09-23
See Project
21

Mozc

Mozc - a Japanese Input Method Editor designed for multi-platform

Mozc is an open source Japanese Input Method Editor (IME) developed by Google, designed to provide Japanese text input across multiple operating systems including Android, macOS, Windows, GNU/Linux, and Chromium OS. The project originated as a subset of Google Japanese Input, released publicly under the BSD 3-Clause license for community use and development. Mozc offers core IME functionality such as text conversion, prediction, and dictionary-based input, enabling users to efficiently type...

Downloads: 8 This Week

Last Update: 18 hours ago
See Project
22

URH (Universal Radio Hacker)

Universal Radio Hacker: Investigate Wireless Protocols Like A Boss

Universal Radio Hacker (URH) is a tool for investigating unknown wireless communication protocols. It supports signal capturing, decoding, modulation analysis, and protocol reverse engineering through an intuitive graphical interface. URH is used in research, security testing, and hobbyist electronics for understanding proprietary RF systems, IoT device communication, and embedded protocols.

Downloads: 29 This Week

Last Update: 2025-07-01
See Project
23

pdfly

CLI tool to extract (meta)data from PDF and manipulate PDF files

A Python library designed for manipulating PDF files with functionalities for extraction, transformation, and document generation.

Downloads: 3 This Week

Last Update: 2025-10-13
See Project
24

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo

CogVideo is an open source text-/image-/video-to-video generation project that hosts the CogVideoX family of diffusion-transformer models and end-to-end tooling. The repo includes SAT and Diffusers implementations, turnkey demos, and fine-tuning pipelines (including LoRA) designed to run across a wide range of NVIDIA GPUs, from desktop cards (e.g., RTX 3060) to data-center hardware (A100/H100). Current releases cover CogVideoX-2B, CogVideoX-5B, and the upgraded CogVideoX1.5-5B variants, plus...

Downloads: 6 This Week

Last Update: 2025-10-04
See Project
25

DeepSeek VL

Towards Real-World Vision-Language Understanding

DeepSeek-VL is DeepSeek’s initial vision-language model that anchors their multimodal stack. It enables understanding and generation across visual and textual modalities—meaning it can process an image + a prompt, answer questions about images, caption, classify, or reason about visuals in context. The model is likely used internally as the visual encoder backbone for agent use cases, to ground perception in downstream tasks (e.g. answering questions about a screenshot). The repository...

Downloads: 1 This Week

Last Update: 2025-10-03
See Project