Showing 324 open source projects for "images"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Error to trace to log to deploy. One click. No SSH. Icon
    Error to trace to log to deploy. One click. No SSH.

    Catch the cause before the pager goes off.

    AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
    Free 30 days.
  • 1
    Dynamic Routing Between Capsules

    Dynamic Routing Between Capsules

    A PyTorch implementation of the NIPS 2017 paper

    Dynamic Routing Between Capsules is a PyTorch implementation of the Capsule Network architecture originally proposed to address limitations in traditional convolutional neural networks. Capsule networks aim to improve how neural models represent spatial hierarchies and relationships between objects within images. Instead of scalar neuron activations, capsules output vectors that encode both the presence of features and their spatial properties such as orientation or pose. The repository implements the dynamic routing algorithm between capsules, which allows lower-level features to route their outputs to higher-level structures that best represent the detected patterns. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    LUMINOTH

    LUMINOTH

    Deep Learning toolkit for Computer Vision

    ...The framework is implemented in Python and built on top of TensorFlow and the Sonnet neural network library, providing a modular environment for training and deploying detection models. It was created to simplify the process of building and experimenting with deep learning models capable of identifying objects within images. Luminoth includes support for popular object detection architectures such as Faster R-CNN and SSD, enabling developers to train models on datasets like COCO and Pascal VOC. The toolkit provides command-line utilities for dataset management, training, and inference, making it easier to integrate into research workflows and production systems. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    SG2Im

    SG2Im

    Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201

    sg2im is a research codebase that learns to synthesize images from scene graphs—structured descriptions of objects and their relationships. Instead of conditioning on free-form text alone, it leverages graph structure to control layout and interactions, generating scenes that respect constraints like “person left of dog” or “cup on table.” The pipeline typically predicts object layouts (bounding boxes and masks) from the graph, then renders a realistic image conditioned on those layouts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    TensorImage

    Image classification library for easily training and deploying models

    ...TensorImage is easy to use and manage as all files, trained models and data are organized within a workspace directory, which you can change at any time in the configuration file, therefore being able have an indefinite amount of workspace directories for different purposes. Moreover, TensorImage can also be used to classify on thousands of images with trained image classification models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    Exposure

    Exposure

    Learning infinite-resolution image processing with GAN and RL

    ...ACM Transactions on Graphics (presented at SIGGRAPH 2018) Exposure is originally designed for RAW photos, which assumes 12+ bit color depth and linear "RGB" color space (or whatever we get after demosaicing). jpg and png images typically have only 8-bit color depth (except 16-bit pngs) and the lack of information (dynamic range/activation resolution) may lead to suboptimal results such as posterization. Moreover, jpg and most pngs assume an sRGB color space, which contains a roughly 1/2.2 Gamma correction, making the data distribution different from training images (which are linear). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Generative Models

    Generative Models

    Collection of generative models, e.g. GAN, VAE in Pytorch

    ...The repository contains practical implementations of well-known architectures such as Generative Adversarial Networks (GANs), Variational Autoencoders (VAEs), Restricted Boltzmann Machines, and Helmholtz Machines, implemented primarily using modern deep learning frameworks like PyTorch and TensorFlow. These models are widely used in artificial intelligence to generate new data that resembles the training data, such as images, text, or other structured outputs. The repository serves as an educational and experimental environment where users can study how generative models work internally and replicate results from academic research papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LearningToCompare_FSL

    LearningToCompare_FSL

    Learning to Compare: Relation Network for Few-Shot Learning

    LearningToCompare_FSL is a PyTorch implementation of the “Learning to Compare: Relation Network for Few-Shot Learning” paper, focusing on the few-shot learning experiments described in that work. The core idea implemented here is the relation network, which learns to compare pairs of feature embeddings and output relation scores that indicate whether two images belong to the same class, enabling classification from only a handful of labeled examples. The repository provides training and evaluation code for standard few-shot benchmarks such as miniImageNet and Omniglot, making it possible to reproduce the experimental results reported in the paper. It includes model definitions, data loading logic, episodic training loops, and scripts that implement the N-way K-shot evaluation protocol common in few-shot research. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Face Recognition

    World's simplest facial recognition api for Python & the command line

    ...Face Recognition is highly accurate and is able to do a number of things. It can find faces in pictures, manipulate facial features in pictures, identify faces in pictures, and do face recognition on a folder of images from the command line. It could even do real-time face recognition and blur faces on videos when used with other Python libraries.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    EvalAI

    EvalAI

    Evaluating state of the art in AI

    ...If the challenge needs extra computational power, challenge organizers can easily add their own cluster of worker nodes to process participant submissions while we take care of hosting the challenge, handling user submissions, and maintaining the leaderboard. EvalAI lets participants submit code for their agent in the form of docker images which are evaluated against test environments on the evaluation server. During the evaluation, the worker fetches the image, test environment, and model snapshot and spins up a new container to perform the evaluation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 10
    DIGITS

    DIGITS

    Deep Learning GPU training system

    The NVIDIA Deep Learning GPU Training System (DIGITS) puts the power of deep learning into the hands of engineers and data scientists. DIGITS can be used to rapidly train the highly accurate deep neural network (DNNs) for image classification, segmentation and object detection tasks. DIGITS simplifies common deep learning tasks such as managing data, designing and training neural networks on multi-GPU systems, monitoring performance in real-time with advanced visualizations, and selecting...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FastPhotoStyle

    FastPhotoStyle

    Style transfer, deep learning, feature transform

    ...The framework is particularly useful in applications such as photo editing, film post-processing, and dataset augmentation where realism is critical. By preserving structural details and avoiding distortions, it produces results that are visually consistent with natural images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Siamese and triplet learning

    Siamese and triplet learning

    Siamese and triplet networks with online triplet mining in PyTorch

    Siamese and triplet learning is a PyTorch implementation of Siamese and triplet neural network architectures designed for learning embedding representations in machine learning tasks. These types of networks learn to map images into a compact feature space where the distance between vectors reflects the similarity between inputs. Such embeddings are commonly used in applications like face recognition, image similarity search, and few-shot learning. The repository demonstrates how to train these models using contrastive loss and triplet loss functions, which encourage embeddings of similar samples to be close while pushing dissimilar samples farther apart. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Image classification models for Keras

    Image classification models for Keras

    Keras code and weights files for popular deep learning models

    All architectures are compatible with both TensorFlow and Theano, and upon instantiation the models will be built according to the image dimension ordering set in your Keras configuration file at ~/.keras/keras.json. For instance, if you have set image_dim_ordering=tf, then any model loaded from this repository will get built according to the TensorFlow dimension ordering convention, "Width-Height-Depth". Pre-trained weights can be automatically loaded upon instantiation (weights='imagenet'...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    House3D

    House3D

    A Realistic and Rich 3D Environment

    ...Each environment includes fully labeled 3D objects, allowing agents to perceive and interact with their surroundings through multiple sensory modalities including RGB images, depth maps, semantic segmentation masks, and top-down maps. The simulator is optimized for high-performance rendering, achieving thousands of frames per second to enable efficient large-scale training of RL agents. House3D has served as the foundation for several influential research projects such as RoomNav (for concept-based navigation) and Embodied Question Answering (EQA).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Deepo

    Deepo

    Set up deep learning environment in a single command line

    ...For users in China who may suffer from slow speeds when pulling the image from the public Docker registry, you can pull deepo images from the China registry mirror by specifying the full path, including the registry, in your docker pull command. This should work and enables Deepo to use the GPU from inside a docker container.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Five video classification methods

    Five video classification methods

    Code that accompanies my blog post outlining five video classification

    Classifying video presents unique challenges for machine learning models. As I’ve covered in my previous posts, video has the added (and interesting) property of temporal features in addition to the spatial features present in 2D images. While this additional information provides us more to work with, it also requires different network architectures and, often, adds larger memory and computational demands.We won’t use any optical flow images. This reduces model complexity, training time, and a whole whack load of hyperparameters we don’t have to worry about. Every video will be subsampled down to 40 frames. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Training Image Operators from Samples

    Tools to train Image Operators automatically from a set of samples.

    TRIOS - Training Image Operators from Samples is a set of tools to bring Image Processing closer to scientists in general. It is capable of estimating an operator between two images using only pairs of samples that contain an input image and the desired output. The operator is saved to a file and can be applied to any image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Neural Photo Editor

    Neural Photo Editor

    A simple interface for editing natural photos

    ...The project implements the system described in the research paper Neural Photo Editing with Introspective Adversarial Networks, which introduces a generative model capable of modifying images in semantically meaningful ways. Instead of editing images by directly manipulating pixels, the software allows users to influence changes in the latent space of a trained generative model. This approach enables large and coherent modifications to images while preserving visual realism. The system relies on an Introspective Adversarial Network, a hybrid architecture combining elements of variational autoencoders and generative adversarial networks to improve reconstruction accuracy and generative quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Convolution arithmetic

    Convolution arithmetic

    A technical report on convolution arithmetic in deep learning

    A technical report on convolution arithmetic in the context of deep learning. The code and the images of this tutorial are free to use as regulated by the licence and subject to proper attribution. The animations will be output to the gif directory. Individual animation steps will be output in PDF format to the pdf directory and in PNG format to the png directory. We introduce a guide to help deep learning practitioners understand and manipulate convolutional neural network architectures. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    mullpy

    Multilabel-learning library built on python

    Mullpy is a machine-learning library that mainly aim to solve multi-label problems. It is classifier independent, has many ensemble capabilities (diversity methods like bagging, random subspaces, etc.) and automated results presentation (Excel, images as ROC or class-separated info, etc.). It is fully configurable. At the moment supports Neural Networks and classifiers defined in files. It is working on python3.3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Ivolution

    Ivolution

    Timelapse creation using Face Recognition

    Ivolution is a face timelapse generator. Feed it with a bunch of images and it will generate a movie with your face centered on the screen. Ivolution uses face detection and modifies the images so that your face always keeps the same size and location over the movie. Images are processed in chronological order, so that you can see your face evoluate over time !
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    This project consists in a set of challenges to recognize images acquired from 3d Lasers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    OCR Reader

    The tool supports template-based parsing, allowing structured output i

    OCR Reader is a lightweight Windows utility designed to extract text from PDF files and images using OCR (Tesseract engine). The tool supports template-based parsing, allowing structured output into CSV or TXT without manual coding. Core components Tesseract OCR engine Poppler (PDF rendering) Template-based extraction system Homepage: https://martan1484.github.io/OCR_Reader
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    scripthea

    Scripthea is designed to streamline of crafting prompts for T2I gen.

    ...This modular approach allows users to experiment with various combinations, facilitating a more systematic exploration of visual styles and themes. Why Scripthea? - Systematically explore various artistic styles and themes - Efficiently manage and review large batches of generated images. - Gain deeper insights into the relationship between prompts and visual outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo