Open Source Computer Vision Libraries - Page 8

  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    The Java parallel to the popular Intel computer vision library, OpenCV. OpenJCV = Open Java Computer Vision
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenTLD

    OpenTLD

    OpenTLD is an open source library for real-time 2D tracking

    OpenTLD is an open source implementation of the TLD (Tracking-Learning-Detection) framework, designed for real-time 2D tracking of a single object in video sequences. Because it fuses tracking and detection, TLD can recover from occlusions, drift, or failures by using its detection mechanism to reacquire the object. In terms of usage, one typically initializes the tracker by providing a bounding box on the first frame, then calls a function like run_TLD to process a video and obtain bounding boxes over time. The system updates its internal models as frames are processed, and can re-detect the target when tracking fails. The algorithm’s performance is known to improve over time due to its online adaptation behavior. Because of its age and MATLAB dependencies, adopting it in modern C++ / real-time pipelines may require effort (e.g. rewriting or porting) or using more recent tracking libraries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    The Tensor Voting Framework is a powerful technique for perceptual grouping, manifold learning, etc. It has proved to be a useful tool in the Computer Vision community. OpenTVF is an open source implementation of TVF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Overhead Imagery Research Data Set (OIRDS) - an annotated data library & tools to aid in the development of computer vision algorithms
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Papier-Mâché is a toolkit for building tangible user interfaces that employ computer vision, RFID tags, and/or bar-codes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Proposed is an algorithm that uses computer vision, combined with a modified Rubine classifier, to allow arbitrary N-sided polygons as accepted sketches in real-time.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Portable Robotics Eye Vergence Control

    Portable Robotics Eye Vergence Control

    Eye movements control portable on different robotic stereo heads

    This project provides a software module for the control of the binocular coordination of a robotic stereo head, based on a bio-inspired algorithm. The project is now available for the iCub platform to work on YARP [https://github.com/stino78/vergence-control/][1] The algorithm works on the top of a distributed representation of binocular disparity supplied by a population of binocular energy-model neural units. The project allows a robust control and adaptive binocular coordination for different robot stereo platforms. Reference publications: Gibaldi, A., Vanegas, M., Canessa, A., & Sabatini, S. P. (2017). A portable bio-inspired architecture for efficient robotic vergence control. International Journal of Computer Vision,. Gibaldi, A., Canessa, A., Chessa, M., Sabatini, S. P., & Solari, F. (2011, October). A neuromorphic control module for real-time vergence eye movements on the iCub robot head. In Humanoid Robots (Humanoids), 2011
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    ProximityForest

    Efficient Approximate Nearest Neighbors for General Metric Spaces

    A proximity forest is a data structure that allows for efficient computation of approximate nearest neighbors of arbitrary data elements in a metric space. See: O'Hara and Draper, "Are You Using the Right Approximate Nearest Neighbor Algorithm?", WACV 2013 (best student paper award). One application of a ProximityForest is given in the following CVPR publication: Stephen O'Hara and Bruce A. Draper, "Scalable Action Recognition with a Subspace Forest," IEEE Conference on Computer Vision and Pattern Recognition, 2012. This source code is provided without warranty and is available under the GPL license. More commercially-friendly licenses may be available. Please contact Stephen O'Hara for license options. Please view the wiki on this site for installation instructions and examples on reproducing the results of the papers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PyArmadillo

    PyArmadillo

    linear algebra library for Python

    PyArmadillo - streamlined linear algebra library for Python, with emphasis on ease of use. Alternative to NumPy / SciPy. * Main page: https://pyarma.sourceforge.io * Documentation: https://pyarma.sourceforge.io/docs.html * Bug reports: https://pyarma.sourceforge.io/faq.html * Git repo: https://gitlab.com/jason-rumengan/pyarma
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    PyCls

    PyCls

    Codebase for Image Classification Research, written in PyTorch

    pycls is a focused PyTorch codebase for image classification research that emphasizes reproducibility and strong, transparent baselines. It popularized families like RegNet and supports classic architectures (ResNet, ResNeXt) with clean implementations and consistent training recipes. The repository includes highly tuned schedules, augmentations, and regularization settings that make it straightforward to match reported accuracy without guesswork. Distributed training and mixed precision are first-class, enabling fast experiments on multi-GPU setups with simple, declarative configs. Model definitions are concise and modular, making it easy to prototype new blocks or swap backbones while keeping the rest of the pipeline unchanged. Pretrained weights and evaluation scripts cover common datasets, and the logging/metric stack is designed for quick comparison across runs. Practitioners use pycls both as a baseline factory and as a scaffold for new classification backbones.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PyTorch SimCLR

    PyTorch SimCLR

    PyTorch implementation of SimCLR: A Simple Framework

    For quite some time now, we know about the benefits of transfer learning in Computer Vision (CV) applications. Nowadays, pre-trained Deep Convolution Neural Networks (DCNNs) are the first go-to pre-solutions to learn a new task. These large models are trained on huge supervised corpora, like the ImageNet. And most important, their features are known to adapt well to new problems. This is particularly interesting when annotated training data is scarce. In situations like this, we take the models’ pre-trained weights, append a new classifier layer on top of it, and retrain the network. This is called transfer learning, and is one of the most used techniques in CV. Aside from a few tricks when performing fine-tuning (if the case), it has been shown (many times) that if training for a new task, models initialized with pre-trained weights tend to learn faster and be more accurate then training from scratch using random initialization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    The Python Computer Vision Framework is an opened project deisgned for all those interested in computer vision. It aims at making computer vision more easy and structured and matlab-free. It may also be used for other artistic and scientific areas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    QVision: Computer Vision Library for Qt

    QVision: Computer Vision Library for Qt

    Computer vision and image processing library for Qt.

    This library contains among other things a set of graphical widgets for video output, performance evaluation and augmented reality. The library also provides classes for several data types usually required by computer vision and image processing applications such as vectors, matrices, quaternions and images. Thanks to a large number of wrapper functions these objects can be used with highly efficient functionality from third party libraries such as OpenCV, GNU Scientific Library, Computational Geometry Algorithms Library, Intel's Math Kernel Library and Integrated Performance Primitives, the Octave library, etc...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    R-FCN

    R-FCN

    R-FCN: Object Detection via Region-based Fully Convolutional Networks

    R-FCN (“Region-based Fully Convolutional Networks”) is an object detection framework that makes almost all computation fully convolutional and shared across the image, unlike prior region-based approaches (e.g. Faster R-CNN) which run per-region sub-networks. The repository provides an implementation (in Python) supporting end-to-end training and inference of R-FCN models on standard datasets. The authors propose position-sensitive score maps to reconcile the need for translation variance (in detection) and translation invariance (in classification). R-FCN is efficient (low per-region overhead) and competitive in accuracy (e.g. with ResNet backbones). Position-sensitive score maps for per-region classification without expensive per-region convs. Optional “deformable R-FCN” extension for improved performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    R1-V

    R1-V

    Witness the aha moment of VLM with less than $3

    R1-V is an initiative aimed at enhancing the generalization capabilities of Vision-Language Models (VLMs) through Reinforcement Learning in Visual Reasoning (RLVR). The project focuses on building a comprehensive framework that emphasizes algorithm enhancement, efficiency optimization, and task diversity to achieve general vision-language intelligence and visual/GUI agents. The team's long-term goal is to contribute impactful open-source research in this domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Raster Vision

    Raster Vision

    Open source framework for deep learning satellite and aerial imagery

    Raster Vision is an open source framework for Python developers building computer vision models on satellite, aerial, and other large imagery sets (including oblique drone imagery). There is built-in support for chip classification, object detection, and semantic segmentation using PyTorch. Raster Vision allows engineers to quickly and repeatably configure pipelines that go through core components of a machine learning workflow: analyzing training data, creating training chips, training models, creating predictions, evaluating models, and bundling the model files and configuration for easy deployment. The input to a Raster Vision pipeline is a set of images and training data, optionally with Areas of Interest (AOIs) that describe where the images are labeled. The output of a Raster Vision pipeline is a model bundle that allows you to easily utilize models in various deployment scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Rcnn

    Rcnn

    R-CNN: Regions with Convolutional Neural Network Features

    This repository contains the original MATLAB implementation of R-CNN (Regions with Convolutional Neural Networks), a pioneering deep learning-based object detection framework. Developed by Ross Girshick, R-CNN combines region proposals with convolutional neural networks to detect objects in images. It was one of the first approaches to significantly improve performance on object detection benchmarks like PASCAL VOC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Computer vision video game controller for the Call of Duty game series.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Microsoft Kinect is known to be an affordable commercial sensor. As an introduction to computer vision, we developed a small program to show how it can be used in machine learning, more specifically in the field of object recognition using support vector machines (SVM). Demo: http://www.youtube.com/watch?v=DDdShv1pQTg
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Facis is a computer vision project based on OpenCV. It enables face detection and identification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    RoboRealm Extensions are plugins to the RoboRealm application that allow you to extend RoboRealm in whatever way you need. RoboRealm is a powerful computer vision based application for use in machine vision, image analysis, and image processing systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Robogathers

    Robogathers

    A simple opensource 3d network game

    Robogathers is a 3d game, developed mainly for academic purposes to teach future software engineers with 3d graphic processing, mathematical modelling, networking and multithreading. In the far future we also want to use this project as a base for scientific research in fields of visual odometry, simultaneous localization and mapping and computer vision. The game consist in driving your robot gathering good prizes and avoiding bad ones. The game is multiplayer and during it you can compete with other people via network as well as with AI robots.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SAHI

    SAHI

    A lightweight vision library for performing large object detection

    A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major challenge in surveillance applications. Such objects are represented by small number of pixels in the image and lack sufficient details, making them difficult to detect using conventional detectors. In this work, an open-source framework called Slicing Aided Hyper Inference (SAHI) is proposed that provides a generic slicing aided inference and fine-tuning pipeline for small object detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Low-power approximate adders provide basic building blocks for approximate computing hardware that have shown remarkable energy efficiency for error-resilient applications (like image/video processing, computer vision, etc.), especially for battery-driven portable systems. In this paper, we present a novel scalable, fast yet accurate analytical method to evaluate the output error probability of multi-bit low power adders for a predetermined probability of input bits. Our method recursively computes the error probability by considering the accurate cases only, which are considerably smaller than the erroneous ones. Our method can handle the error analysis of a wider-range of adders with negligible computational overhead. To ensure its rapid adoption in industry and academia, we have open-sourced our LabVIEW and MATLAB libraries. Lab Web Page: http://save.seecs.nust.edu.pk/projects/SEALPAA/ Emails: 14mseemayub@seecs.edu.pk, osman.hasan@seecs.edu.pk, muhammad.shafique@tuwien.ac.at
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SOD

    SOD

    An Embedded Computer Vision & Machine Learning Library

    SOD is an embedded, modern cross-platform computer vision and machine learning software library that expose a set of APIs for deep-learning, advanced media analysis & processing including real-time, multi-class object detection and model training on embedded systems with limited computational resource and IoT devices. SOD was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in open source as well as commercial products. SOD implements state-of-the-art computer vision algorithms found to be mandatory in real-world application areas. Sobel operator, Otsu's binarization and over 100 image/frame processing & analysis interfaces. Designed for computational efficiency and with a strong focus on real-time applications. SOD includes a comprehensive set of both classic and state-of-the-art deep-neural networks with their pre-trained models.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB