Open Source Linux Computer Vision Libraries - Page 2

Sort By:

Computer Vision Libraries for Linux

View 20 business solutions

Computer Vision Libraries Linux Clear Filters

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

torchvision

Datasets, transforms and models specific to Computer Vision

The torchvision package consists of popular datasets, model architectures, and common image transformations for computer vision. We recommend Anaconda as Python package management system. Torchvision currently supports Pillow (default), Pillow-SIMD, which is a much faster drop-in replacement for Pillow with SIMD, if installed will be used as the default. Also, accimage, if installed can be activated by calling torchvision.set_image_backend('accimage'), libpng, which can be installed via conda conda install libpng or any of the package managers for debian-based and RHEL-based Linux distributions, and libjpeg, which can be installed via conda conda install jpeg or any of the package managers for debian-based and RHEL-based Linux distributions. It supports libjpeg-turbo as well. libpng and libjpeg must be available at compilation time in order to be available. TorchVision also offers a C++ API that contains C++ equivalent of python models.

Downloads: 3 This Week

Last Update: 2026-03-10
See Project
2

Mobile Robot Programming Toolkit (MRPT)

**MOVED TO GITHUB** ==> https://github.com/MRPT/mrpt

**MOVED TO GITHUB** ==> https://github.com/MRPT/mrpt The Mobile Robot Programming Toolkit (MRPT) is an extensive, cross-platform, and open source C++ library aimed for robotics researchers to design and implement algorithms about Localization, SLAM, Navigation, computer vision. http://www.mrpt.org/

2 Reviews

Downloads: 25 This Week

Last Update: 2015-02-02
See Project
3

BoofCV

BoofCV is an open source Java library for real-time computer vision.

BoofCV is an open source Java library for real-time computer vision and robotics applications. Written from scratch for ease of use and high performance, it provides both basic and advanced features needed for creating a computer vision system. Functionality include optimized low level image processing routines (e.g. convolution, interpolation, gradient) to high level functionality such as image stabilization. Released under an Apache 2.0 license for both academic and commercial use.

2 Reviews

Downloads: 24 This Week

Last Update: 2025-03-22
See Project
4

DeepLearning

Deep Learning (Flower Book) mathematical derivation

" Deep Learning " is the only comprehensive book in the field of deep learning. The full name is also called the Deep Learning AI Bible (Deep Learning) . It is edited by three world-renowned experts, Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Includes linear algebra, probability theory, information theory, numerical optimization, and related content in machine learning. At the same time, it also introduces deep learning techniques used by practitioners in the industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling and practical methods, and investigates topics such as natural language processing, Applications in speech recognition, computer vision, online recommender systems, bioinformatics, and video games. Finally, the Deep Learning book provides research directions covering theoretical topics including linear factor models, autoencoders, representation learning, structured probabilistic models, etc.

Downloads: 2 This Week

Last Update: 2022-08-02
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Diffgram

Training data (data labeling, annotation, workflow) for all data types

From ingesting data to exploring it, annotating it, and managing workflows. Diffgram is a single application that will improve your data labeling and bring all aspects of training data under a single roof. Diffgram is world’s first truly open source training data platform that focuses on giving its users an unlimited experience. This is aimed to reduce your data labeling bills and increase your Training Data Quality. Training Data is the art of supervising machines through data. This includes the activities of annotation, which produces structured data; ready to be consumed by a machine learning model. Annotation is required because raw media is considered to be unstructured and not usable without it. That’s why training data is required for many modern machine learning use cases including computer vision, natural language processing and speech recognition.

Downloads: 2 This Week

Last Update: 2024-10-14
See Project
6

Screenshot to Code

A neural network that transforms a design mock-up into static websites

Screenshot-to-code is a tool or prototype that attempts to convert UI screenshots (e.g., of mobile or web UIs) into code representations, likely generating layouts, HTML, CSS, or markup from image inputs. It is part of a research/proof-of-concept domain in UI automation and image-to-UI code generation. Mapping visual design to code constructs. Code/UI layout (HTML, CSS, or markup). Examples/demo scripts showing “image UI code”.

Downloads: 2 This Week

Last Update: 2025-09-26
See Project
7

Scene

Scene is a computer vision framework that performs background subtraction and object tracking, using two traditional algorithms and three more recent algorithms based on neural networks and fuzzy classification rules. For each detected object, Scene sends TUIO messages to one or several client applications. The present release features GPU accelerated versions of all the background subtraction methods and morphological post processing of the object blobs with dilation and erosion filters, implemented in OpenCL. The framework was mainly designed as a toolkit for the rapid development of interactive art projects that explore dynamics of complex environments. The Scene GUI runs and compiles under Windows, Linux, and MacOS X, and is available in both 32 bit and 64 bit versions.

3 Reviews

Downloads: 12 This Week

Last Update: 2016-03-14
See Project
8

ArrayFire

ArrayFire, a general purpose GPU library

ArrayFire is a general-purpose tensor library that simplifies the process of software development for the parallel architectures found in CPUs, GPUs, and other hardware acceleration devices. The library serves users in every technical computing market. Data structures in ArrayFire are smartly managed to avoid costly memory transfers and to take advantage of each performance feature provided by the underlying hardware. The community of ArrayFire developers invites you to build with us if you're interested and able to write top performing tensor functions. Together we can fulfill The ArrayFire Mission under an excellent Code of Conduct that promotes a respectful and friendly building experience. Rigorous benchmarks and tests ensuring top performance and numerical accuracy. Cross-platform compatibility with support for CUDA, OpenCL, and native CPU on Windows, Mac, and Linux. Built-in visualization functions through Forge.

Downloads: 1 This Week

Last Update: 2025-09-05
See Project
9

Colossal-AI

Making large AI models cheaper, faster and more accessible

The Transformer architecture has improved the performance of deep learning models in domains such as Computer Vision and Natural Language Processing. Together with better performance come larger model sizes. This imposes challenges to the memory wall of the current accelerator hardware such as GPU. It is never ideal to train large models such as Vision Transformer, BERT, and GPT on a single GPU or a single machine. There is an urgent demand to train models in a distributed environment. However, distributed training, especially model parallelism, often requires domain expertise in computer systems and architecture. It remains a challenge for AI researchers to implement complex distributed training solutions for their models. Colossal-AI provides a collection of parallel components for you. We aim to support you to write your distributed deep learning models just like how you write your model on your laptop.

Downloads: 1 This Week

Last Update: 2025-05-28
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
10

DnCNN

Beyond a Gaussian Denoiser: Residual Learning of Deep CNN

This repository implements DnCNN (“Deep CNN Denoiser”) from the paper “Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising”. DnCNN is a feedforward convolutional neural network that learns to predict the residual noise (i.e. noise map) from a noisy input image, which is then subtracted to yield a clean image. This formulation allows efficient denoising, supports blind Gaussian noise (i.e. unknown noise levels), and can be extended to related tasks like image super-resolution or JPEG deblocking in some variants. The repository includes training code (using MatConvNet / MATLAB), demo scripts, pretrained models, and evaluation routines. Single model handling multiple noise levels.

Downloads: 1 This Week

Last Update: 2025-09-29
See Project
11

Overhead Imagery Research Data Set

Overhead Imagery Research Data Set (OIRDS) - an annotated data library & tools to aid in the development of computer vision algorithms

Downloads: 19 This Week

Last Update: 2018-04-09
See Project
12

OpenNN - Open Neural Networks Library

Machine learning algorithms for advanced analytics

OpenNN is a software library written in C++ for advanced analytics. It implements neural networks, the most successful machine learning method. Some typical applications of OpenNN are business intelligence (customer segmentation, churn prevention…), health care (early diagnosis, microarray analysis…) and engineering (performance optimization, predictive maitenance…). OpenNN does not deal with computer vision or natural language processing. The main advantage of OpenNN is its high performance. This library outstands in terms of execution speed and memory allocation. It is constantly optimized and parallelized in order to maximize its efficiency. The documentation is composed by tutorials and examples to offer a complete overview about the library. OpenNN is developed by Artelnics, a company specialized in artificial intelligence.

8 Reviews

Downloads: 3 This Week

Last Update: 2026-04-07
See Project
13

The VXL Project

A multi-platform collection of C++ software libraries for Computer Vision and Image Understanding.

5 Reviews

Downloads: 2 This Week

Last Update: 2024-10-18
See Project
14

Integrating Vision Toolkit

The Integrating Vision Toolkit (IVT) is a powerful and fast C++ computer vision library with an easy-to-use object-oriented architecture. It offers its own multi-platform GUI toolkit. OpenCV is integrated optionally. Website: http://ivt.sourceforge.net

2 Reviews

Downloads: 3 This Week

Last Update: 2019-11-28
See Project
15

SITPLUS

SITPLUS is a free software framework whose main goal is to provide recreational activities for people with multiple disabilities. It offers new forms of interaction based on computer vision, voice and other peripherals.

1 Review

Downloads: 4 This Week

Last Update: 2014-08-25
See Project
16

OpenPR

OpenPR stands for Open Pattern Recognition project and is intended to be an open source library for algorithms of image processing, computer vision, natural language processing, pattern recognition, machine learning and the related fields.

Downloads: 2 This Week

Last Update: 2018-05-15
See Project
17

horus

Horus is a toolkit to application development that includes inteligent agents. It provides some algorithm to computer vision; processing image; mapping, exploring and navigation of unknown environments; management of inteligent agent.

Downloads: 2 This Week

Last Update: 2013-04-08
See Project
18

Accord.NET Framework

Scientific computing, machine learning and computer vision for .NET

The Accord.NET Framework provides machine learning, mathematics, statistics, computer vision, computer audition, and several scientific computing related methods and techniques to .NET. The project is compatible with the .NET Framework. NET Standard, .NET Core, and Mono.

1 Review

Downloads: 1 This Week

Last Update: 2017-10-19
See Project
19

Blazeface

Blazeface is a lightweight model that detects faces in images

Blazeface is a lightweight, high-performance face detection model designed for mobile and embedded devices, developed by TensorFlow. It is optimized for real-time face detection tasks and runs efficiently on mobile CPUs, ensuring minimal latency and power consumption. Blazeface is based on a fast architecture and uses deep learning techniques to detect faces with high accuracy, even in challenging conditions. It supports multiple face detection in varying lighting and poses, and is designed to work in real-world applications like mobile apps, robotics, and other resource-constrained environments.

Downloads: 1 This Week

Last Update: 2025-03-19
See Project
20

ImageNets

ImageNets is an computer vision and image processing extension to the OpenCV library for user-friendly programming of Robot Vision algorithms. ImageNets uses Qt by Nokia. ImageNets is ideal for education of image processing.

Downloads: 1 This Week

Last Update: 2012-09-28
See Project
21

Open Cezeri Library

Effective Linear Algebra and Computer Vision Library with JAVA

OCL stands for Open Cezeri Library (yet another linear algebra and matrix library). This library provides rapid coding as matlab ease of use. To learn for library please try to use test examples at OpenCezeriLibrary\test\test. It is originally developed at el-cezeri laboratory of Siirt University, in order to establish generic framework of reusable components and software tools for machine vision, machine learning, AI and robotic applications. Currently, it holds following main concepts 1- Vision: It can access web cams, imaging source industrial cameras for manuel settings and advanced issues. Studies on accesing Leapmotion and Kinect is still under-development. 2- Machine learning: It uses Weka Software tool and some personel coded ML algorithms 3- CMatrix: Special matrix library called as CMatrix meaning Cezeri Maztrix Class. Actually it is regarded as the core of the OCL. CMatrix supports fluent interface and method chaining.

Downloads: 1 This Week

Last Update: 2016-04-11
See Project
22

OpenImage

Java image viewer, an open source java learning application. Currently in alpha stage.

Downloads: 1 This Week

Last Update: 2014-06-26
See Project
23

QVision: Computer Vision Library for Qt

Computer vision and image processing library for Qt.

This library contains among other things a set of graphical widgets for video output, performance evaluation and augmented reality. The library also provides classes for several data types usually required by computer vision and image processing applications such as vectors, matrices, quaternions and images. Thanks to a large number of wrapper functions these objects can be used with highly efficient functionality from third party libraries such as OpenCV, GNU Scientific Library, Computational Geometry Algorithms Library, Intel's Math Kernel Library and Integrated Performance Primitives, the Octave library, etc...

Downloads: 1 This Week

Last Update: 2013-07-02
See Project
24

TSPS

TSPS is an open toolkit for sensing people in spaces.

The Toolkit for Sensing People in Spaces (TSPS) is an open-source tool for creating interactive applications based on natural user interaction. It wraps computer vision algorithms into a simple, easy-to-use interface. TSPS was developed in openFrameworks for use in design, prototyping, and installed systems. We use TSPS for retail, fine arts installations, rapid prototyping, educational workshops, and provide the source openly for use in production. It builds on the backs of giants and our hope is that we can contribute ways to make it easier for beginners to explore computer vision while also providing a framework for experts to build off of.

Downloads: 1 This Week

Last Update: 2016-02-11
See Project
25

ViPER

The Video Processing Evaluation Resource: A toolkit for evaluating computer vision algorithms on video, and a corresponding tool for annotating video streams with spatial metadata.

Downloads: 1 This Week

Last Update: 2013-04-08
See Project