Open Source Computer Vision Libraries - Page 9

  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 1

    SURF-nanodots

    Very basic computer vision program

    This was my first big programming project. I'd like to clean it up and improve the back end design when time permits. See the wiki for known issues and more information. A computer vision program for analysis of magnetic data collected by a scanning probe microscope. Originated in summer 2007 as a collection of C compiled for Matlab (MEX) files and was eventually ported to a standalone C++ application with a GUI created in Qt. This program takes atomic and magnetic force microscope (AFM/MFM) image pairs as input and uses threshold segmentation to identify magnetic nanodots by intensity in the AFM image. These are then used to assess the magnetic states of those dots in the MFM image Attribution: "C++ GUI Programming with Qt 4" by Blanchette and Summerfield was helpful in getting me started on the GUI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    Savant

    Python Computer Vision & Video Analytics Framework With Batteries Incl

    Savant is an open-source, high-level framework for building real-time, streaming, highly efficient multimedia AI applications on the Nvidia stack. It helps to develop dynamic, fault-tolerant inference pipelines that utilize the best Nvidia approaches for data center and edge accelerators. Savant is built on DeepStream and provides a high-level abstraction layer for building inference pipelines. It is designed to be easy to use, flexible, and scalable. It is a great choice for building smart CV and video analytics applications for cities, retail, manufacturing, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Scene
    Scene is a computer vision framework that performs background subtraction and object tracking, using two traditional algorithms and three more recent algorithms based on neural networks and fuzzy classification rules. For each detected object, Scene sends TUIO messages to one or several client applications. The present release features GPU accelerated versions of all the background subtraction methods and morphological post processing of the object blobs with dilation and erosion filters, implemented in OpenCL. The framework was mainly designed as a toolkit for the rapid development of interactive art projects that explore dynamics of complex environments. The Scene GUI runs and compiles under Windows, Linux, and MacOS X, and is available in both 32 bit and 64 bit versions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Self-learning-Computer-Science

    Self-learning-Computer-Science

    Resources to learn computer science in your spare time

    Self-learning Computer Science is a curated, open-source guide repository designed to help learners independently study computer science topics using high-quality university-level resources. The author (an undergraduate CS student) assembled links to courses from institutions like MIT, UC Berkeley, Stanford, etc., covering mathematics, programming, data structures/algorithms, computer architecture, machine learning, software engineering and more. It’s aimed at learners who find traditional course structures restrictive and want a flexible, self-paced path through CS, with a focus on building depth and breadth rather than shortcut exam skills. The repository provides a roadmap, references, teaching materials, and sometimes the author’s own project examples, offering both guidance and community support. Because the CS field is broad, the structure helps learners allocate study time, avoid duplication, and benefit from “best in class” resources instead of randomly browsing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    Show Facebook Computer Vision Tags

    Show Facebook Computer Vision Tags

    Chrome Extension that displays automated image tags from Facebook

    Show Facebook Computer Vision Tags is a Chrome (and Firefox) browser extension created to expose and overlay the automatically generated image tags that Facebook applies to photos in users’ feeds. Since Facebook uses a computer-vision model to analyse user-uploaded images and generate alt-text tags for accessibility (e.g., “Image may contain: golf, grass, outdoor and nature”), this extension surfaces those hidden tags directly in the UI—revealing what kind of information Facebook infers about images (objects present, activities being done, environment). The purpose is educational and somewhat cautionary: to help users understand the scope of visual inference and privacy issues. Once installed, the extension overlays those tags on images in the timeline, making visible what is typically hidden metadata. The project is relatively lightweight but has garnered attention due to its privacy transparency angle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Computer Vision Application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A software to implement the existing stereo matching algorithms in computer vision, including the easiest SSD, and the newest algorithms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    T81 558

    T81 558

    Applications of Deep Neural Networks

    Deep learning is a group of exciting new technologies for neural networks. Through a combination of advanced training techniques and neural network architectural components, it is now possible to create neural networks that can handle tabular data, images, text, and audio as both input and output. Deep learning allows a neural network to learn hierarchies of information in a way that is like the function of the human brain. This course will introduce the student to classic neural network structures, Convolution Neural Networks (CNN), Long Short-Term Memory (LSTM), Gated Recurrent Neural Networks (GRU), General Adversarial Networks (GAN) and reinforcement learning. Application of these architectures to computer vision, time series, security, natural language processing (NLP), and data generation will be covered. High-Performance Computing (HPC) aspects will demonstrate how deep learning can be leveraged both on graphical processing units (GPUs), as well as grids.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    The STAIR Vision Library (SVL), originally developed to support the STanford AI Robot, provides software infrastructure for computer vision, machine learning, and probabilistic graphical models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10

    Time Adaptive Self-Organizing Map

    An Artificial Neural Network for Clustering, Classification, etc

    This project tries to include Time Adaptive Self-Organizing Map (TASOM) implementations for solving Computational Intelligence problems such as Pattern Recognition, Computer Vision, Clustering, Active Contour Modeling, and the like. The TASOM has been originally introduced for adaptive and changing environments. Several versions of TASOM networks have been introduced. Some of them are capable of changing the number of neurons based on the problems at hand. Moreover, a binary tree version of the TASOM has been introduced for faster performance.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Tiny

    Tiny

    Tiny Face Detector, CVPR 2017

    This repository implements the Tiny Face Detector (from Hu & Ramanan, CVPR 2017) in MATLAB (using MatConvNet). The method is designed to detect tiny faces (i.e. very small-scale faces) by combining multi-scale context modeling, foveal descriptors, and scale enumeration strategies. It provides training/testing scripts, a demo (tiny_face_detector.m), model loading, evaluation on WIDER FACE, and supporting utilities (e.g. cnn_widerface_eval.m). The code depends on MatConvNet, which must be compiled (with GPU / CUDA / cuDNN support) for full performance. Pretrained model provided (ResNet101-based, plus alternatives). Demo and evaluation scripts for benchmark datasets. Use of “foveal descriptors” to incorporate context for low-resolution faces. Pretrained model provided (ResNet101-based, plus alternatives).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Toolbox

    Toolbox

    Piotr's Image & Video Matlab Toolbox

    Piotr’s Image & Video MATLAB Toolbox is a general-purpose MATLAB toolbox for image and video processing and vision tasks, offering utilities, filters, detection, feature extraction, and algorithm building blocks. Example and demo scripts for usage (e.g. acfReadme, detector readmes). It augments MATLAB’s native capabilities (not replacing the Image Processing Toolbox) by providing efficient, reusable wrappers and optimized routines. Example and demo scripts for usage (e.g. acfReadme, detector readmes). Support for compilation / mex (for speed) and cross-platform compatibility.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    C++ library that implements basic data structures and algorithms for computer vision and image processing applications
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Travis

    Travis

    Travis is a small library for computer vision in robots

    Travis contains several algorithms for computer vision in robotics. There are tracking, motion detection and reactive navigation functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    UAPD

    UAlbany Pedestrian Detection Project

    UAlbany Pedestrian Detection Project (Computer Vision)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    UoMASM

    UoMASM

    A C++ Active Shape Model Library from the University of Manchester

    A library and associated tools for building and searching with Active Shape Models (ASM). It builds on the VXL computer vision libraries. Includes a GUI tool to search new images with the ASM. Depends on: VXL, Qt (Version 4), UoMqVXL
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    UoMqVXL

    UoMqVXL

    Qt based GUI classes for the VXL Computer Vision Libraries

    Qt based GUI classes for VXL from the University of Manchester. Includes libraries to display VXL images and graphics, including shape model manipulation. Also includes tools to display images and points, and to annotate images with points.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    VOSM2DExplorer

    VOSM2DExplorer

    VOSM 2D Explorer

    An Eclipse SWT Java GUI based application to demonstrate 2D statistical models in computer vision, for face representation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    VRN

    VRN

    Code for "Large Pose 3D Face Reconstruction

    The VRN (Volumetric Regression Network) repository implements the “Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression” method. Instead of explicitly fitting a 3D model via landmark estimation and deformation, VRN treats the reconstruction task as volumetric segmentation: it learns a CNN to regress a 3D volume aligned to the input image, and then extracts a mesh via isosurface from that volume. The network is unguided (no 2D landmarks as intermediate). The mesh surfaces can be textured (in MATLAB branch) and colored. Docker container provided for easy CPU deployment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    computer vision workshop project - realtime video processing and simulation of musical instruments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Vehícular Traffic Variables Acquisition
    Acquisition Software of vehicular traffic variables using the computer vision library OpenCV and Qt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ViAmI-Server

    ViAmI-Server

    Pattern recognition for ADL events

    This software uses computer vision algorithms for mining sequence data from telemonitoring data with CBRs. We propose an approach which treats the detection of changes in behavior detected with a sensor/video fusion, which occur at radically different time-scales, through a CBR in two levels: low and high level. The system is always updating the database with the daily data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ViKi (Virtual Interactive keyboard Interface) is a global framework that enables contactless human machine interaction using computer vision techniques. Only a simple webcam is sufficient to emulate traditional devices such as mouse and keyboard do.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    VideoMan Library

    VideoMan Library

    C++ library for image acquisition and visualization

    Library for capturing video from cameras, 3d sensors, frame-grabbers, video files and image sequences. It can also display multiple images using OpenGL with different layouts. Easy integration with OpenCV, CUDA... Perfect for computer vision. Keywords: video capture, computer vision, machine vision, opencv, opengl, cameras, video input devices, firewire, usb, gige
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Vision Analysis Toolkit provides a framework for the development of computer vision algorithms using arbitrary named channel containers, flexible data types, and 1/2/3-D buffers.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB