Showing 25 open source projects for "pixels"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Atera - an All-in-one platform for IT management Icon
    Atera - an All-in-one platform for IT management

    Ideal for IT departments and MSPs (managed service providers)

    Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!
    Try Atera now
  • 1
    pixelmatch

    pixelmatch

    The smallest, simplest JavaScript pixel-level image comparison library

    The smallest, simplest and fastest JavaScript pixel-level image comparison library, originally created to compare screenshots in tests. Features accurate anti-aliased pixels detection and perceptual color difference metrics. Inspired by Resemble.js and Blink-diff. Unlike these libraries, pixelmatch is around 150 lines of code, has no dependencies, and works on raw typed arrays of image data, so it's blazing fast and can be used in any environment (Node or browsers). Compares two images, writes the output diff and returns the number of mismatched pixels.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    canvas-constructor

    canvas-constructor

    An ES6 utility for canvas with built-in functions and chained methods

    An ES6 utility for canvas with built-in functions and chained methods. Alternatively, you can import canvas-constructor/browser. That will create a canvas with size of 300 pixels width, 300 pixels height. Set the color to #AEFD54. Draw a rectangle with the previous color, covering all the pixels from (5, 5) to (290 + 5, 290 + 5) Set the color to #FFAE23. Set the font size to 28 pixels with font Impact. Write the text 'Hello World!' in the position (130, 150) Return a buffer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Caire

    Caire

    Content aware image resize library

    ...Using a dynamic programming approach the algorithm will generate individual seams across the image from top to down, or from left to right (depending on the horizontal or vertical resizing) and will allocate for each seam a custom value, the least important pixels having the lowest energy cost and the most important ones having the highest cost. We traverse the image from the second row to the last row and compute the cumulative minimum energy for all possible connected seams for each entry. The minimum energy level is calculated by summing up the current pixel value with the lowest value of the neighboring pixels obtained from the previous row.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Sprite Fusion Pixel Snapper

    Sprite Fusion Pixel Snapper

    A tool to snap pixels to a perfect grid

    Sprite Fusion Pixel Snapper is a utility designed to eliminate sub-pixel rendering issues that often arise in pixel art, UI icons, and 2D sprite graphics when displayed on screens with high DPI or during motion animations. The tool works by adjusting sprite rendering coordinates and texture sampling so that every pixel aligns cleanly to the screen’s pixel grid, avoiding blurring, distortion, or unintended smoothing artifacts. This is especially important in pixel art games, retro-styled...
    Downloads: 5 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    VOID

    VOID

    Video Object and Interaction Deletion

    VOID is an advanced AI video processing system developed by Netflix that focuses on removing objects from videos while preserving the physical and visual realism of the surrounding environment. Unlike traditional inpainting methods that only erase pixels or simple artifacts, VOID models the full interaction dynamics between objects and their environment, including shadows, reflections, and even physical consequences such as movement or balance changes. Built on top of transformer-based architectures and fine-tuned for video inpainting tasks, the system uses interaction-aware mask conditioning to ensure temporal consistency across frames. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    SAHI

    SAHI

    A lightweight vision library for performing large object detection

    ...Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major challenge in surveillance applications. Such objects are represented by small number of pixels in the image and lack sufficient details, making them difficult to detect using conventional detectors. In this work, an open-source framework called Slicing Aided Hyper Inference (SAHI) is proposed that provides a generic slicing aided inference and fine-tuning pipeline for small object detection.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    JEPA

    JEPA

    PyTorch code and models for V-JEPA self-supervised learning from video

    JEPA (Joint-Embedding Predictive Architecture) captures the idea of predicting missing high-level representations rather than reconstructing pixels, aiming for robust, scalable self-supervised learning. A context encoder ingests visible regions and predicts target embeddings for masked regions produced by a separate target encoder, avoiding low-level reconstruction losses that can overfit to texture. This makes learning focus on semantics and structure, yielding features that transfer well with simple linear probes and minimal fine-tuning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Color Thief

    Color Thief

    Grab the color palette from an image using just Javascript

    ...When run in Node, this argument expects a path to the image. quality is an optional argument that must be an Integer of value 1 or greater, and defaults to 10. The number determines how many pixels are skipped before the next one is sampled. We rarely need to sample every single pixel in the image to get good results. The bigger the number, the faster a value will be returned. Gets a palette from the image by clustering similar colors. The palette is returned as an array containing colors, each color itself an array of three integers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    vJEPA-2

    vJEPA-2

    PyTorch code and models for VJEPA2 self-supervised learning from video

    VJEPA2 is a next-generation self-supervised learning framework for video that extends the “predict in representation space” idea from i-JEPA to the temporal domain. Instead of reconstructing pixels, it predicts the missing high-level embeddings of masked space-time regions using a context encoder and a slowly updated target encoder. This objective encourages the model to learn semantics, motion, and long-range structure without the shortcuts that pixel-level losses can invite. The architecture is designed to scale: spatiotemporal ViT backbones, flexible masking schedules, and efficient sampling let it train on long clips while remaining stable. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    The Arcade Learning Environment

    The Arcade Learning Environment

    The Arcade Learning Environment (ALE) -- a platform for AI research

    ...This environment suite has been central to many RL breakthroughs, including value-based agents, deep Q-nets, and general-agent benchmarking, because the Atari games span many genres and present diverse learning challenges (pixels, actions, delayed rewards). The repository supports multi‐platform build (Linux, macOS, Windows), vectorized execution of games, Python bindings, Gymnasium registration, and a large set of game ROMs bundled for convenience. While its rendering may not match modern 3D environments, its importance lies in reproducibility, benchmarking, and the fact that many RL baselines and papers reference ALE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Qwen-VL

    Qwen-VL

    Chat & pretrained large vision language model

    Qwen-VL is Alibaba Cloud’s vision-language large model family, designed to integrate visual and linguistic modalities. It accepts image inputs (with optional bounding boxes) and text, and produces text (and sometimes bounding boxes) as output. The model variants (VL-Plus, VL-Max, etc.) have been upgraded for better visual reasoning, text recognition from images, fine-grained understanding, and support for high image resolutions / extreme aspect ratios. Qwen-VL supports multilingual inputs...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    cleanvideo-cli

    cleanvideo-cli

    CLI tool for removing watermarks from AI-generated videos using frame-

    cleanvideo-cli is a command-line tool designed to remove visible watermarks from AI-generated videos. It works by analyzing video frames and reconstructing the underlying pixels in watermark regions, without cropping or blurring the original content. This project is intended for developers, researchers, and creators who need a lightweight utility for cleaning preview or draft videos before further processing. Note: This tool does not bypass platform restrictions and should be used only on content you own or have the rights to use.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    FLUX.1 Krea

    FLUX.1 Krea

    Powerful open source image generation model

    ...It is a rectified-flow model distilled from the original Krea 1, providing enhanced sampling efficiency through classifier-free guidance distillation. The model supports generation at resolutions between 1024 and 1280 pixels with recommended inference steps between 28 and 32 for optimal balance of speed and quality. FLUX.1 Krea is fully compatible with the FLUX.1 architecture, making it easy to integrate into existing workflows and pipelines. The repository offers easy-to-use inference scripts and a Jupyter Notebook example to facilitate quick experimentation and adoption. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    PyDenseCRF

    PyDenseCRF

    Python wrapper to Philipp Krähenbühl's dense (fully connected) CRFs

    ...The project allows developers and researchers to integrate Dense CRF inference into Python-based machine learning pipelines, particularly for computer vision tasks such as image segmentation and labeling. Conditional Random Fields are probabilistic graphical models used to model contextual relationships between neighboring pixels or features, improving prediction consistency across images. By implementing a fully connected CRF model with Gaussian edge potentials, the library enables efficient inference across all pixel pairs in an image rather than only local neighborhoods. The Python wrapper is implemented using Cython, allowing high-performance CRF computations while maintaining a Python-friendly interface for experimentation and development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Aphantasia

    Aphantasia

    CLIP + FFT/DWT/RGB = text to image/video

    ...Generating massive detailed textures, a la deepdream, fullHD/4K resolutions and above, various CLIP models (including multi-language from SBERT), continuous mode to process phrase lists (e.g. illustrating lyrics), pan/zoom motion with smooth interpolation. Direct RGB pixels optimization (very stable) depth-based 3D look (courtesy of deKxi, based on AdaBins), complex queries: text and/or image as main prompts, separate text prompts for style and to subtract (avoid) topics. Starting/resuming process from saved parameters or from an image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Fashion-MNIST

    Fashion-MNIST

    A MNIST-like fashion product database

    ...It was designed as a direct replacement for the original MNIST handwritten digits dataset, maintaining the same structure and image size so that researchers could easily switch datasets without modifying their experimental pipelines. The dataset consists of 70,000 images in total, with 60,000 examples used for training and 10,000 reserved for testing. Each image has a resolution of 28 by 28 pixels and belongs to one of ten clothing classes, making it suitable for evaluating classification models. Because the dataset represents real-world objects rather than handwritten digits, it offers a more challenging benchmark for testing machine learning algorithms.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 17
    DensePose

    DensePose

    A real-time approach for mapping all human pixels of 2D RGB images

    ...DensePose is widely used in augmented reality, motion capture, virtual try-on, and visual effects applications because it enables real-time 3D human mapping from 2D inputs. The model architecture builds on Mask R-CNN, using additional regression heads to predict UV coordinates that map image pixels to 3D surfaces.
    Downloads: 164 This Week
    Last Update:
    See Project
  • 18
    Image GPT

    Image GPT

    Large-scale autoregressive pixel model for image generation by OpenAI

    Image-GPT is the official research code and models from OpenAI’s paper Generative Pretraining from Pixels. The project adapts GPT-2 to the image domain, showing that the same transformer architecture can model sequences of pixels without altering its fundamental structure. It provides scripts to download pretrained checkpoints of different model sizes (small, medium, large) trained on large-scale datasets and includes utilities for handling color quantization with a 9-bit palette. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 19
    PixelCNN

    PixelCNN

    Code for the paper "PixelCNN++: A PixelCNN Implementation..."

    PixelCNN is the official implementation from OpenAI of the autoregressive generative model described in the paper Conditional Image Generation with PixelCNN Decoders. It provides code for training and evaluating PixelCNN models on image datasets, focusing on conditional image modeling where pixels are generated sequentially based on the values of previously generated pixels. The repository demonstrates how to apply masked convolutions to enforce autoregressive dependencies and achieve tractable likelihood-based training. It also includes scripts for reproducing key experimental results from the paper, such as conditional sampling on datasets like CIFAR-10. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    imgaug

    imgaug

    Image augmentation for machine learning experiments

    ...Affine transformations, perspective transformations, contrast changes, gaussian noise, dropout of regions, hue/saturation changes, cropping/padding, blurring, etc. Rotate image and segmentation map on it by the same value sampled. Convert keypoints to distance maps, extract pixels within bounding boxes from images, clip polygon to the image plane, etc. Scale segmentation maps, average/max pool of images/maps, pad images to aspect ratios (e.g. to square them). Draw heatmaps, segmentation maps, keypoints, bounding boxes, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PyTorch pretrained BigGAN

    PyTorch pretrained BigGAN

    PyTorch implementation of BigGAN with pretrained weights

    An op-for-op PyTorch reimplementation of DeepMind's BigGAN model with the pre-trained weights from DeepMind. This repository contains an op-for-op PyTorch reimplementation of DeepMind's BigGAN that was released with the paper Large Scale GAN Training for High Fidelity Natural Image Synthesis. This PyTorch implementation of BigGAN is provided with the pretrained 128x128, 256x256 and 512x512 models by DeepMind. We also provide the scripts used to download and convert these models from the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Neural Photo Editor

    Neural Photo Editor

    A simple interface for editing natural photos

    ...The project implements the system described in the research paper Neural Photo Editing with Introspective Adversarial Networks, which introduces a generative model capable of modifying images in semantically meaningful ways. Instead of editing images by directly manipulating pixels, the software allows users to influence changes in the latent space of a trained generative model. This approach enables large and coherent modifications to images while preserving visual realism. The system relies on an Introspective Adversarial Network, a hybrid architecture combining elements of variational autoencoders and generative adversarial networks to improve reconstruction accuracy and generative quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Universe

    Universe

    Software for measuring and training an AI's general intelligence

    ...It does this by packaging the program into a Docker container, and presenting the AI with the same interface a human uses: sending keyboard and mouse events, and receiving screen pixels. Our initial release contains over 1,000 environments in which an AI agent can take actions and gather observations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    tracking.js

    tracking.js

    A modern approach for Computer Vision on the web

    The tracking.js library brings different computer vision algorithms and techniques into the browser environment. By using modern HTML5 specifications, we enable you to do real-time color tracking, face detection and much more, all that with a lightweight core (~7 KB) and intuitive interface. To get started, download the project. This project includes all of the tracking.js examples, source code dependencies you'll need to get started. Unzip the project somewhere on your local drive. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    BayesianCortex

    BayesianCortex

    simple algorithm for a realtime interactive visual cortex for painting

    A paint program where the canvas is the visual cortex of a simple kind of artificial intelligence. You paint with the mouse into its dreams and it responds by changing what you painted gradually. There will also be an API for using it with other programs as a general high-dimensional space. Each pixel's brightness is its own dimension. Bayesian nodes have exactly 3 childs because that is all thats needed to do NAND in a fuzzy way as Bayes' Rule which is NAND at certain extremes. NAND can be...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next