scale image free download

Showing 225 open source projects for "scale image"

View related business solutions

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
1

Image Harmonization Dataset iHarmony4

The first large-scale public benchmark dataset for image harmonization

This repository provides the iHarmony4 dataset, which is a large-scale dataset designed for image harmonization tasks. Image harmonization involves adjusting the appearance of a foreground in a composite image so that it is consistent with the background (in color, tone, illumination, etc.). The iHarmony4 dataset comprises four sub-datasets (HCOCO, HAdobe5k, HFlickr, Hday2night), each making composite images by combining a foreground from one image with a background from another, along with associated ground truth harmonized images and foreground masks. ...

Downloads: 0 This Week

Last Update: 2026-02-24
See Project
2

Z-Image

Image generation model with single-stream diffusion transformer

Z-Image is an efficient, open-source image generation foundation model built to make high-quality image synthesis more accessible. With just 6 billion parameters — far fewer than many large-scale models — it uses a novel “single-stream diffusion Transformer” architecture to deliver photorealistic image generation, demonstrating that excellence does not always require extremely large model sizes.

Downloads: 22 This Week

Last Update: 2026-02-09
See Project
3

Exclusively Dark Image Dataset

ExDARK dataset is the largest collection of low-light images

The Exclusively Dark (ExDARK) dataset is one of the largest curated collections of real-world low-light images designed to support research in computer vision tasks under challenging lighting conditions. It contains 7,363 images captured across ten different low-light scenarios, ranging from extremely dark environments to twilight. Each image is annotated with both image-level labels and object-level bounding boxes for 12 object categories, making it suitable for detection and classification tasks. The dataset was created to address the lack of large-scale low-light datasets available for research in object detection, recognition, and enhancement. It has been widely used in studies of low-light image enhancement, deep learning approaches, and domain adaptation for vision models. ...

Downloads: 8 This Week

Last Update: 4 days ago
See Project
4

Wan2.1

Wan2.1: Open and Advanced Large-Scale Video Generative Model

...The model supports text-to-video and image-to-video generation tasks with flexible resolution options suitable for various GPU hardware configurations. Wan2.1’s architecture balances generation quality and inference cost, paving the way for later improvements seen in Wan2.2 such as Mixture-of-Experts and enhanced aesthetics. It was trained on large-scale video and image datasets, providing generalization across diverse scenes and motion patterns.

1 Review

Downloads: 59 This Week

Last Update: 2026-03-05
See Project
Stop vibe-debugging.
Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.

Free 30 days.
5

Point Cloud Library

A standalone, large scale, open project for 2D/3D image processing

The Point Cloud Library (PCL) is a standalone, large scale, open project for 2D/3D image and point cloud processing. PCL is released under the terms of the BSD license, and thus free for commercial and research use. Whether you’ve just discovered PCL or you’re a long time veteran, this page contains links to a set of resources that will help consolidate your knowledge on PCL and 3D processing.

Downloads: 19 This Week

Last Update: 2025-08-27
See Project
6

Pix

Image management application

Pix is an image management application with image viewing, browsing, organizing and editing capabilities. It is part of the X-Apps project, which aims at producing cross-distribution and cross-desktop software. Pix supports numerous image types including: BMP, JPEG, GIF, PNG, TIFF, TGA, ICO and XPM; with optional support for RAW and HDR (high dynamic range) images. It is also able to view EXIF data attached to JPEG images. Pix has its own set of image editing tools that enable you to...

Downloads: 21 This Week

Last Update: 2026-01-08
See Project
7

simpleParallax.js

Simple and tiny JavaScript library that adds parallax animations

simpleParallax.js is a very simple and tiny Vanilla JS library that adds parallax animations on any image. Where it may be laborious to get results through other plugins, simpleParallax.js stands out for its ease and its visual rendering. The parallax effect is directly applied to image tags, there is no need to use background images.

Downloads: 20 This Week

Last Update: 6 days ago
See Project
8

pix2pixHD

Synthesizing and manipulating 2048x1024 images with conditional GANs

pix2pixHD is a PyTorch-based implementation of a conditional generative adversarial network designed for high-resolution image-to-image translation, capable of producing photorealistic outputs at resolutions up to 2048×1024. It is widely used to convert structured inputs such as semantic label maps into realistic images, making it particularly valuable in applications like autonomous driving simulation, face synthesis, and scene generation. The model improves upon earlier GAN approaches by introducing multi-scale generators and discriminators that enable stable training and fine detail generation at large resolutions. ...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
9

CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

CogView4 is the latest generation in the CogView series of vision-language foundation models, developed as a bilingual (Chinese and English) open-source system for high-quality image understanding and generation. Built on top of the GLM framework, it supports multimodal tasks including text-to-image synthesis, image captioning, and visual reasoning. Compared to previous CogView versions, CogView4 introduces architectural upgrades, improved training pipelines, and larger-scale datasets, enabling stronger alignment between textual prompts and generated visual content. ...

Downloads: 4 This Week

Last Update: 4 days ago
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

Wan2.2

Wan2.2: Open and Advanced Large-Scale Video Generative Model

...The model is trained on significantly larger datasets than its predecessor, greatly enhancing motion complexity, semantic understanding, and aesthetic diversity. Wan2.2 also open-sources a 5-billion parameter high-compression VAE-based hybrid text-image-to-video (TI2V) model that supports 720P video generation at 24fps on consumer-grade GPUs like the RTX 4090. It supports multiple video generation tasks including text-to-video.

1 Review

Downloads: 84 This Week

Last Update: 2026-03-17
See Project
11

SwarmUI

Modular AI image and video generation web UI with extensible tools

SwarmUI is a modular web-based user interface designed for AI-driven image generation, with a strong focus on usability, performance, and extensibility. It serves as a unified environment for working with multiple AI models, including Stable Diffusion and newer image and video generation systems, allowing users to create and manage outputs through a browser interface. SwarmUI is built to accommodate both beginners and advanced users by offering a simple “Generate” interface alongside more...

Downloads: 14 This Week

Last Update: 2026-03-18
See Project
12

Picsur

An easy to use, selfhostable image sharing service like Imgur

Picsur is a lightweight, self-hosted image hosting service inspired by Imgur. Built using Go and Vue, it offers fast image uploads, user accounts, and a minimal UI. Picsur is designed for private or small-scale image sharing and gives users full control over their data and hosting environment.

Downloads: 0 This Week

Last Update: 2025-06-11
See Project
13

Easy Diffusion

An easy 1-click way to create beautiful artwork on your PC using AI

...It provides a browser-based user interface that runs locally, allowing users to type text prompts and immediately generate images directly within their web browser, democratizing access to powerful text-to-image models for artists and hobbyists alike. The project abstracts away environment setup, dependencies, and model installation — tasks that can be daunting to beginners — and instead lets users focus on creative experimentation with prompt phrasing, model parameters, and image output settings. Because it’s designed to be easy to install and use, EasyDiffusion’s interface includes options for queuing multiple jobs, applying modifiers like upscaling or face correction, and adjusting generation parameters like guidance scale and resolution.

Downloads: 19 This Week

Last Update: 2026-03-31
See Project
14

Hunyuan3D 2.0

High-Resolution 3D Assets Generation with Large Scale Diffusion Models

The Hunyuan3D-2 model, developed by Tencent, is designed for generating high-resolution 3D assets using large-scale diffusion models. This model offers advanced capabilities for creating detailed 3D models, including texture enhancements, multi-view shape generation, and rapid inference for real-time applications. It is particularly useful for industries requiring high-quality 3D content, such as gaming, film, and virtual reality. Hunyuan3D-2 supports various enhancements and is available...

Downloads: 33 This Week

Last Update: 2025-10-28
See Project
15

gallery-dl

Command-line program to download image galleries and collections

...With its broad site compatibility and flexible configuration system, it is widely used for automating large-scale gallery downloads.

Downloads: 34 This Week

Last Update: 6 days ago
See Project
16

HunyuanImage-3.0

A Powerful Native Multimodal Model for Image Generation

HunyuanImage-3.0 is a powerful, native multimodal text-to-image generation model released by Tencent’s Hunyuan team. It unifies multimodal understanding and generation in a single autoregressive framework, combining text and image modalities seamlessly rather than relying on separate image-only diffusion components. It uses a Mixture-of-Experts (MoE) architecture with many expert subnetworks to scale efficiently, deploying only a subset of experts per token, which allows large parameter counts without linear inference cost explosion. ...

1 Review

Downloads: 7 This Week

Last Update: 2026-02-03
See Project
17

Final2x

2^x Image Super-Resolution

The tool is available for Windows x64/arm64, MacOS x64/arm64, and Linux x64, allowing users to enjoy the benefits of super-resolution regardless of their operating system. It offers a wide range of models that can be used to achieve different levels of super-resolution, allowing users to choose the one that best suits their specific needs. Users have the flexibility to specify the desired output size for their images, ranging from small enhancements to large-scale super-resolution. The tool...

Downloads: 19 This Week

Last Update: 2025-10-05
See Project
18

ImageBind

ImageBind One Embedding Space to Bind Them All

ImageBind is a multimodal embedding framework that learns a shared representation space across six modalities—images, text, audio, depth, thermal, and IMU (inertial motion) data—without requiring explicit pairwise training for every modality combination. Instead of aligning each pair independently, ImageBind uses image data as the central binding modality, aligning all other modalities to it so they can interoperate zero-shot. This creates a unified embedding space where representations from any modality can be compared or retrieved against any other (e.g., matching sound to text or depth to image). The model is trained using large-scale contrastive learning, leveraging diverse datasets from natural images, videos, audio clips, and sensor data. ...

Downloads: 0 This Week

Last Update: 2025-11-21
See Project
19

uCrop

Image cropping library for Android

We develop lots of different Android apps at Yalantis, and our experience shows that almost every application we deal with needs image cropping functionality. Image cropping can be used for various purposes, from ordinary adjustment of user profile images to more complex features that involve aspect ratio cropping and flexible image transformations. Since we want to provide all our customers with the best set of tools for image editing functionality, we decided to create uCrop, an image...

Downloads: 2 This Week

Last Update: 2025-08-04
See Project
20

Spegel

Stateless cluster local OCI registry mirror.

Spegel is a distributed container image registry mirror designed to speed up container image pulls in large-scale Kubernetes clusters. It locally mirrors container images to cluster nodes, reducing latency and bandwidth consumption during container deployments. Spegel integrates natively with containerd and CRI-O, ensuring seamless operation in container runtimes without changing workflows.

Downloads: 3 This Week

Last Update: 2026-06-18
See Project
21

infinite-canvas

Infinite Canvas Workbench for AI creation integrates AI generation

infinite-canvas is an open-source visual workspace for AI-assisted image creation and iterative creative planning. It combines canvas organization, image generation, reference image editing, chat assistance, prompt libraries, and asset management in one interface. Users can work across multiple canvases, drag and scale nodes, connect ideas visually, use a minimap, undo changes, and import or export work.

Downloads: 0 This Week

Last Update: 2026-06-16
See Project
22

ViMax

Director, Screenwriter, Producer, and Video Generator All-in-One

ViMax is an open-source framework for performing large-scale multi-modal vision-language modeling and reasoning by combining powerful image encoders with advanced language models to solve complex visual tasks. It integrates components like visual encoders, cross-modal fusion techniques, and reasoning modules so that users can go beyond simple captioning or classification to perform tasks such as visual question answering, multi-image inference, and structured scene understanding. ...

Downloads: 1 This Week

Last Update: 2026-06-08
See Project
23

ML Sharp

Sharp Monocular View Synthesis in Less Than a Second

ML Sharp is a research code release that turns a single 2D photograph into a photorealistic 3D representation that can be rendered from nearby viewpoints. Instead of requiring multi-view input, it predicts the parameters of a 3D Gaussian scene representation directly from one image using a single forward pass through a neural network. The core idea is speed: the 3D representation is produced in under a second on a standard GPU, and then the resulting scene can be rendered in real time to generate new views interactively. The representation is metric, meaning it supports camera movements with an absolute scale rather than only relative depth cues, which is useful for consistent viewpoint changes and downstream spatial tasks. ...

Downloads: 2 This Week

Last Update: 2026-01-29
See Project
24

OpenShot Video Editor

Award-Winning Open Source Video Editing Software

OpenShot Video Editor is a powerful yet very simple and easy-to-use video editor that delivers high quality video editing and animation solutions. OpenShot offers a myriad of features and capabilities, including powerful curve-based Key frame animations, 3D animated titles and effects, slow motion and time effects, audio mixing and editing, and so much more. It’s available for Linux, Mac and Windows, with a very simple and friendly interface. Start creating stunning videos quickly and easily...

6 Reviews

Downloads: 86 This Week

Last Update: 2026-04-08
See Project
25

Perception Models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models

...The project supports a wide range of research applications, from visual recognition and dense prediction to fine-grained multimodal understanding. Additionally, it includes several large-scale open datasets for both image and video perception.

Downloads: 2 This Week

Last Update: 6 days ago
See Project