Showing 29 open source projects for "computer vision"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Kornia

    Kornia

    Open Source Differentiable Computer Vision Library

    ...With Kornia we fill the gap between classical and deep computer vision that implements standard and advanced vision algorithms for AI. Our libraries and initiatives are always according to the community needs.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Albumentations

    Albumentations

    Fast image augmentation library and an easy-to-use wrapper

    Albumentations is a computer vision tool that boosts the performance of deep convolutional neural networks. Albumentations is a Python library for fast and flexible image augmentations. Albumentations efficiently implements a rich variety of image transform operations that are optimized for performance, and does so while providing a concise, yet powerful image augmentation interface for different computer vision tasks, including object classification, segmentation, and detection. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    MIVisionX

    MIVisionX

    Set of comprehensive computer vision & machine intelligence libraries

    ...AMD OpenVX is a highly optimized open-source implementation of the Khronos OpenVX™ 1.3 computer vision specification. It allows for rapid prototyping as well as fast execution on a wide range of computer hardware, including small embedded x86 CPUs and large workstation discrete GPUs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ArrayFire

    ArrayFire

    ArrayFire, a general purpose GPU library

    ArrayFire is a general-purpose tensor library that simplifies the process of software development for the parallel architectures found in CPUs, GPUs, and other hardware acceleration devices. The library serves users in every technical computing market. Data structures in ArrayFire are smartly managed to avoid costly memory transfers and to take advantage of each performance feature provided by the underlying hardware. The community of ArrayFire developers invites you to build with us if...
    Downloads: 4 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Deep Learning Essay Reading

    Deep Learning Essay Reading

    Read classic and new deep learning papers paragraph by paragraph

    ...Instead of reading entire dense academic papers, contributors provide structured breakdowns and insights into the most influential research from the past decade, often including explanation highlights and key takeaways. The content spans foundational models, architectures, and training methodologies across computer vision, natural language processing, generative models, and other machine learning domains. These summaries help students, researchers, and engineers stay up to date with breakthroughs in the field without needing to sift through full academic documents. With thousands of stars and forks, this repository has become a widely referenced learning resource for anyone interested in understanding the technical ideas behind major advancements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    CoreNet

    CoreNet

    CoreNet: A library for training deep neural networks

    ...Its distributed runtime manages synchronization, load balancing, and mixed-precision computation to maximize throughput while minimizing communication bottlenecks. CoreNet integrates tightly with Apple’s proprietary ML stack and hardware, serving as the foundation for research in computer vision, language models, and multimodal systems within Apple AI. The framework includes monitoring tools, fault tolerance mechanisms, and efficient checkpointing for massive training runs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CO3D (Common Objects in 3D)

    CO3D (Common Objects in 3D)

    Tooling for the Common Objects In 3D dataset

    CO3Dv2 (Common Objects in 3D, version 2) is a large-scale 3D computer vision dataset and toolkit from Facebook Research designed for training and evaluating category-level 3D reconstruction methods using real-world data. It builds upon the original CO3Dv1 dataset, expanding both scale and quality—featuring 2× more sequences and 4× more frames, with improved image fidelity, more accurate segmentation masks, and enhanced annotations for object-centric 3D reconstruction. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    fvcore

    fvcore

    Collection of common code shared among different research projects

    fvcore is a lightweight utility library that factors out common performance-minded components used across Facebook/Meta computer-vision codebases. It provides numerics and loss layers (e.g., focal loss, smooth-L1, IoU/GIoU) implemented for speed and clarity, along with initialization helpers and normalization layers for building PyTorch models. Its common modules include timers, logging, checkpoints, registry patterns, and configuration helpers that reduce boilerplate in research code. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ml-surveys

    ml-surveys

    Survey papers summarizing advances in deep learning, NLP, CV, graphs

    The ml-surveys repository is a broad, maintainable overview of survey papers across many subfields of machine learning — including deep learning, NLP, computer vision, graph ML, reinforcement learning, recommendation systems, embeddings, meta-learning, and more. Instead of diving into code or experiments, this repo gathers authoritative survey and review articles, summarizing the state-of-the-art, trends, challenges, and directions within each subdomain. For someone trying to get up to speed with a new ML subfield — say graph neural networks or meta-learning — ml-surveys offers a curated reading list of foundational and recent works, helping map the landscape quickly. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Darknet

    Darknet

    Convolutional Neural Networks

    ...With GPU acceleration via CUDA and OpenCV integration, it achieves high performance in image recognition tasks. Its simplicity, combined with powerful capabilities, has made Darknet one of the most influential projects in the computer vision community.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 11
    Flux3D.jl

    Flux3D.jl

    3D computer vision library in Julia

    Flux3D.jl is a 3D vision library, written completely in Julia. This package utilizes Flux.jl and Zygote.jl as its building blocks for training 3D vision models and for supporting differentiation. This package also have support of CUDA GPU acceleration with CUDA.jl.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    CameraVoyeur

    CameraVoyeur

    Windows-oriented utility to log frames from a connected camera devices

    ...Because it’s in a regular GitHub repo and not a closed binary, you can inspect the code to see how device selection, frame acquisition, and UI/display are wired. This makes it useful for developers who want to add camera capture to a Windows tool or who need a barebones sample to build surveillance, monitoring, or computer vision toys on top of. Its value is really in being a simple, readable reference rather than a big camera suite.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    YOLO ROS

    YOLO ROS

    YOLO ROS: Real-Time Object Detection for ROS

    This is a ROS package developed for object detection in camera images. You only look once (YOLO) is a state-of-the-art, real-time object detection system. In the following ROS package, you are able to use YOLO (V3) on GPU and CPU. The pre-trained model of the convolutional neural network is able to detect pre-trained classes including the data set from VOC and COCO, or you can also create a network with your own detection objects. The YOLO packages have been tested under ROS Noetic and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CNN for Image Retrieval
    ...It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that can be adapted for custom datasets, making it useful for experimenting with retrieval systems in computer vision. By leveraging CNN architectures, the project showcases how learned embeddings can capture semantic similarity across varied images. This resource serves as both an educational reference and a foundation for further exploration in image retrieval research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    OpenPose

    OpenPose

    Real-time multi-person keypoint detection library for body, face, etc.

    OpenPose has represented the first real-time multi-person system to jointly detect human body, hand, facial, and foot keypoints (in total 135 keypoints) on single images. It is authored by Ginés Hidalgo, Zhe Cao, Tomas Simon, Shih-En Wei, Yaadhav Raaj, Hanbyul Joo, and Yaser Sheikh. It is maintained by Ginés Hidalgo and Yaadhav Raaj. OpenPose would not be possible without the CMU Panoptic Studio dataset. We would also like to thank all the people who has helped OpenPose in any way. 15, 18 or...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 16
    Top Deep Learning Projects

    Top Deep Learning Projects

    A list of popular github projects related to deep learning

    ...Rather than being a library itself, it serves as a curated roadmap and reference guide for anyone exploring the deep learning ecosystem — from beginners to experienced practitioners. By aggregating high-star projects across frameworks (TensorFlow, PyTorch), tools (computer vision, NLP, reinforcement learning), tutorials, and research code, it helps users quickly discover reputable and well-maintained repositories. This way one can survey state-of-the-art projects, find learning resources, or pick stable libraries for production — without manually sifting through hundreds of repos. The repository is openly licensed under MIT, making it easy to fork, extend, or contribute updates (e.g. adding newer projects or reordering by recent popularity).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    DeepLearning

    DeepLearning

    Deep Learning (Flower Book) mathematical derivation

    ...At the same time, it also introduces deep learning techniques used by practitioners in the industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling and practical methods, and investigates topics such as natural language processing, Applications in speech recognition, computer vision, online recommender systems, bioinformatics, and video games. Finally, the Deep Learning book provides research directions covering theoretical topics including linear factor models, autoencoders, representation learning, structured probabilistic models, etc.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    Replica Dataset

    Replica Dataset

    High-fidelity indoor 3D dataset for AI simulation and robotics

    Replica Dataset is a high-quality 3D dataset of realistic indoor environments designed to advance research in computer vision, robotics, and embodied AI. Developed by Facebook Research (now Meta AI), it features accurate geometric reconstructions, high-resolution and high dynamic range textures, and comprehensive semantic annotations. Each environment contains detailed models of real-world spaces, including rooms, furniture, glass, and mirror surfaces.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 19
    SFD

    SFD

    S³FD: Single Shot Scale-invariant Face Detector, ICCV, 2017

    ...It includes training scripts, evaluation code, and pre-trained models that achieve strong results on popular benchmarks such as AFW, PASCAL Face, FDDB, and WIDER FACE. The framework is optimized for speed and accuracy, making it suitable for both academic research and practical applications in computer vision.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    VideoMan Library

    VideoMan Library

    C++ library for image acquisition and visualization

    Library for capturing video from cameras, 3d sensors, frame-grabbers, video files and image sequences. It can also display multiple images using OpenGL with different layouts. Easy integration with OpenCV, CUDA... Perfect for computer vision. Keywords: video capture, computer vision, machine vision, opencv, opengl, cameras, video input devices, firewire, usb, gige
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Accord.NET Framework

    Accord.NET Framework

    Scientific computing, machine learning and computer vision for .NET

    The Accord.NET Framework provides machine learning, mathematics, statistics, computer vision, computer audition, and several scientific computing related methods and techniques to .NET. The project is compatible with the .NET Framework. NET Standard, .NET Core, and Mono.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    Face Verification Experiment

    Face Verification Experiment

    Original Caffe Version for LightCNN-9. Highly recommend to use PyTorch

    ...The repository serves as a resource for understanding the application of convolutional neural networks to identity verification, highlighting both methodology and results. It is primarily intended for academic and research purposes in computer vision and biometrics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    tracking.js

    tracking.js

    A modern approach for Computer Vision on the web

    ...Test out the web server by loading the finished version of the project. The main goal of tracking.js is to provide those complex techniques in a simple and intuitive way on the web. We believe computer vision is important to improve people's life, bringing it to the web will make this future a reality a lot faster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    node-opencv

    node-opencv

    OpenCV Bindings for node.js

    OpenCV bindings for Node.js. OpenCV is the defacto computer vision library - by interfacing with it natively in node, we get powerful real time vision in js. People are using node-opencv to fly control quadrocoptors, detect faces from webcam images and annotate video streams. If you're using it for something cool, I'd love to hear about it! You'll need OpenCV 2.3.1 or newer installed before installing node-opencv.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    Awesome Recurrent Neural Networks

    Awesome Recurrent Neural Networks

    A curated list of resources dedicated to RNN

    ...Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn in tensorflow, and much more. Codes, theory, applications, and datasets about natural language processing, robotics, computer vision, and much more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB