Page 3 | Best Open Source Object Detection Models 2026

Object Detection Models

View 13790 business solutions

Object Detection Models Clear Filters

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

ObjectDetector

Car Detection,Face Detectiom,Object Detection

Machine learning: This project is used for training new object like Car,Motor Cycle and so on and we use this model(xml file) for detecting in images.In this project we use viola jones algorithm.

1 Review

Downloads: 0 This Week

Last Update: 2015-08-22
See Project
2

Objectron

A dataset of short, object-centric video clips

The Objectron dataset is a collection of short, object-centric video clips, which are accompanied by AR session metadata that includes camera poses, sparse point-clouds and characterization of the planar surfaces in the surrounding environment. In each video, the camera moves around the object, capturing it from different angles. The data also contain manually annotated 3D bounding boxes for each object, which describe the object’s position, orientation, and dimensions. The dataset consists of 15K annotated video clips supplemented with over 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes. In addition, to ensure geo-diversity, our dataset is collected from 10 countries across five continents. Along with the dataset, we are also sharing a 3D object detection solution for four categories of objects — shoes, chairs, mugs, and cameras.

Downloads: 0 This Week

Last Update: 2022-02-21
See Project
3

Paper2GUI

Convert AI papers to GUI

Convert AI papers to GUI，Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术 Paper2GUI: An AI desktop APP toolbox for ordinary people. It can be used immediately without installation. It already supports 40+ AI models, covering AI painting, speech synthesis, video frame complementing, video super-resolution, object detection, and image stylization. , OCR recognition and other fields. Support Windows, Mac, Linux systems. Paper2GUI: 一款面向普通人的 AI 桌面 APP 工具箱，免安装即开即用，已支持 40+AI 模型，内容涵盖 AI 绘画、语音合成、视频补帧、视频超分、目标检测、图片风格化、OCR 识别等领域。支持 Windows、Mac、Linux 系统。

Downloads: 0 This Week

Last Update: 2024-09-20
See Project
4

PyTorch Transfer-Learning-Library

Transfer Learning Library for Domain Adaptation, Task Adaptation, etc.

TLlib is an open-source and well-documented library for Transfer Learning. It is based on pure PyTorch with high performance and friendly API. Our code is pythonic, and the design is consistent with torchvision. You can easily develop new algorithms or readily apply existing algorithms. We appreciate all contributions. If you are planning to contribute back bug-fixes, please do so without any further discussion. If you plan to contribute new features, utility functions or extensions, please first open an issue and discuss the feature with us.

Downloads: 0 This Week

Last Update: 2022-08-10
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

PyTracking

Visual tracking library based on PyTorch

A general python framework for visual object tracking and video object segmentation, based on PyTorch. Official implementation of the RTS (ECCV 2022), ToMP (CVPR 2022), KeepTrack (ICCV 2021), LWL (ECCV 2020), KYS (ECCV 2020), PrDiMP (CVPR 2020), DiMP (ICCV 2019), and ATOM (CVPR 2019) trackers, including complete training code and trained models.

Downloads: 0 This Week

Last Update: 2023-08-14
See Project
6

Pytorch-toolbelt

PyTorch extensions for fast R&D prototyping and Kaggle farming

A pytorch-toolbelt is a Python library with a set of bells and whistles for PyTorch for fast R&D prototyping and Kaggle farming. Easy model building using flexible encoder-decoder architecture. Modules: CoordConv, SCSE, Hypercolumn, Depthwise separable convolution and more. GPU-friendly test-time augmentation TTA for segmentation and classification. GPU-friendly inference on huge (5000x5000) images. Every-day common routines (fix/restore random seed, filesystem utils, metrics). Losses: BinaryFocalLoss, Focal, ReducedFocal, Lovasz, Jaccard and Dice losses, Wing Loss and more. Extras for Catalyst library (Visualization of batch predictions, additional metrics). By design, both encoder and decoder produces a list of tensors, from fine (high-resolution, indexed 0) to coarse (low-resolution) feature maps. Access to all intermediate feature maps is beneficial if you want to apply deep supervision losses on them or encoder-decoder of object detection task.

Downloads: 0 This Week

Last Update: 2024-11-21
See Project
7

Raster Vision

Open source framework for deep learning satellite and aerial imagery

Raster Vision is an open source framework for Python developers building computer vision models on satellite, aerial, and other large imagery sets (including oblique drone imagery). There is built-in support for chip classification, object detection, and semantic segmentation using PyTorch. Raster Vision allows engineers to quickly and repeatably configure pipelines that go through core components of a machine learning workflow: analyzing training data, creating training chips, training models, creating predictions, evaluating models, and bundling the model files and configuration for easy deployment. The input to a Raster Vision pipeline is a set of images and training data, optionally with Areas of Interest (AOIs) that describe where the images are labeled. The output of a Raster Vision pipeline is a model bundle that allows you to easily utilize models in various deployment scenarios.

Downloads: 0 This Week

Last Update: 2024-08-30
See Project
8

SAHI

A lightweight vision library for performing large object detection

A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major challenge in surveillance applications. Such objects are represented by small number of pixels in the image and lack sufficient details, making them difficult to detect using conventional detectors. In this work, an open-source framework called Slicing Aided Hyper Inference (SAHI) is proposed that provides a generic slicing aided inference and fine-tuning pipeline for small object detection.

Downloads: 0 This Week

Last Update: 2025-09-28
See Project
9

SOD

An Embedded Computer Vision & Machine Learning Library

SOD is an embedded, modern cross-platform computer vision and machine learning software library that expose a set of APIs for deep-learning, advanced media analysis & processing including real-time, multi-class object detection and model training on embedded systems with limited computational resource and IoT devices. SOD was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in open source as well as commercial products. SOD implements state-of-the-art computer vision algorithms found to be mandatory in real-world application areas. Sobel operator, Otsu's binarization and over 100 image/frame processing & analysis interfaces. Designed for computational efficiency and with a strong focus on real-time applications. SOD includes a comprehensive set of both classic and state-of-the-art deep-neural networks with their pre-trained models.

Downloads: 0 This Week

Last Update: 2023-07-26
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

Scalable Multimodal Object Recognition

This is an object recognition library written on top of OpenCV. Scalable Multimodal Object Recognition (SMORs) is designed for real time highly accurate object detection.

Downloads: 0 This Week

Last Update: 2015-04-24
See Project
11

TNN

Uniform deep learning inference framework for mobile

TNN, a high-performance, lightweight neural network inference framework open sourced by Tencent Youtu Lab. It also has many outstanding advantages such as cross-platform, high performance, model compression, and code tailoring. The TNN framework further strengthens the support and performance optimization of mobile devices on the basis of the original Rapidnet and ncnn frameworks. At the same time, it refers to the high performance and good scalability characteristics of the industry's mainstream open source frameworks, and expands the support for X86 and NV GPUs. On the mobile phone, TNN has been used by many applications such as mobile QQ, weishi, and Pitu. As a basic acceleration framework for Tencent Cloud AI, TNN has provided acceleration support for the implementation of many businesses. Everyone is welcome to participate in the collaborative construction to promote the further improvement of the TNN inference framework.

Downloads: 0 This Week

Last Update: 2022-08-03
See Project
12

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and compare the results. Tensor2Tensor, or T2T for short, is a library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research. T2T was developed by researchers and engineers in the Google Brain team and a community of users. It is now deprecated, we keep it running and welcome bug-fixes, but encourage users to use the successor library Trax.

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
13

TensorFlow Object Counting API

The TensorFlow Object Counting API is an open source framework

The TensorFlow Object Counting API is an open source framework built on top of TensorFlow and Keras that makes it easy to develop object counting systems. Please contact if you need professional object detection & tracking & counting project with super high accuracy and reliability! You can train TensorFlow models with your own training data to built your own custom object counter system! If you want to learn how to do it, please check one of the sample projects, which cover some of the theory of transfer learning and show how to apply it in useful projects. The development is on progress! The API will be updated soon, the more talented and light-weight API will be available in this repo! Detailed API documentation and sample jupyter notebooks that explain basic usages of API will be added!

Downloads: 0 This Week

Last Update: 2022-08-19
See Project
14

TensorNets

High level network definitions with pre-trained weights in TensorFlow

High level network definitions with pre-trained weights in TensorFlow (tested with 2.1.0 >= TF >= 1.4.0). Applicability. Many people already have their own ML workflows and want to put a new model on their workflows. TensorNets can be easily plugged together because it is designed as simple functional interfaces without custom classes. Manageability. Models are written in tf.contrib.layers, which is lightweight like PyTorch and Keras, and allows for ease of accessibility to every weight and end-point. Also, it is easy to deploy and expand a collection of pre-processing and pre-trained weights. Readability. With recent TensorFlow APIs, more factoring and less indenting can be possible. For example, all the inception variants are implemented as about 500 lines of code in TensorNets while 2000+ lines in official TensorFlow models. Reproducibility. You can always reproduce the original results with simple APIs including feature extractions.

Downloads: 0 This Week

Last Update: 2022-08-23
See Project
15

Turi Create

Simplifies the development of custom machine learning models

Turi Create simplifies the development of custom machine learning models. You don't have to be a machine learning expert to add recommendations, object detection, image classification, image similarity or activity classification to your app. If you want your app to recognize specific objects in images, you can build your own model with just a few lines of code. Turi Create supports macOS 10.12+, Linux (with glibc 2.10+), Windows 10 (via WSL). Turi Create requires Python 2.7, 3.5, 3.6, 3.7, 3.8. Also, x86_64 architecture, and at least 4 GB of RAM. We recommend using virtualenv to use, install, or build Turi Create. The package User Guide and API Docs contain more details on how to use Turi Create. If you want to build Turi Create from source, see BUILD.md. Turi Create does not require a GPU, but certain models can be accelerated 9-13x by utilizing a GPU.

Downloads: 0 This Week

Last Update: 2021-06-02
See Project
16

UniVL

Official implementation for UniVL video and language training models

UniVL is a video-language pretrain model. It is designed with four modules and five objectives for both video language understanding and generation tasks. It is also a flexible model for most of the multimodal downstream tasks considering both efficiency and effectiveness.

Downloads: 0 This Week

Last Update: 2024-07-12
See Project
17

VOC-DPM

Object detection system using deformable part models (DPMs)

The VOC-DPM repository is an implementation of an object detection system built on deformable part models (DPMs) and latent SVM learning, specifically packaged as “voc-release5.” It is the companion code for Ross Girshick’s dissertation, and extends earlier work on discriminatively trained DPMs. The system supports a grammar-based representation for object models, allowing structures such as mixtures and hierarchies to represent parts and whole objects. It implements both latent SVM training (where part assignments are treated as latent variables) and weak-label structural SVM (WL-SSVM) for learning from partially labeled data. The code integrates several enhancements: a star-cascade detection algorithm to speed up screening, context rescoring (re-ranking detections using contextual information), and various optimizations like in-memory training (rather than large on-disk data files).

Downloads: 0 This Week

Last Update: 2025-09-12
See Project
18

VoTT

Visual Object Tagging Tool, an electron app for building models

Visual Object Tagging Tool: An electron app for building end-to-end Object Detection Models from Images and Videos. An open source annotation and labeling tool for image and video assets. VoTT is a React + Redux Web application, written in TypeScript. This project was bootstrapped with Create React App. VoTT can be installed as a native application or run from source. VoTT is also available as a stand-alone Web application and can be used in any modern Web browser. VoTT is available for Windows, Linux and OSX. Download the appropriate platform package/installer from GitHub Releases. As noted above, the Web version of VoTT cannot access the local file system; all assets must be imported/exported through a Cloud project. VoTT V2 is a refactor and refresh of the original Electron-based application. As the usage and demand for VoTT grew, V2 was started as an initiative to improve and make VoTT more extensible and maintainable.

1 Review

Downloads: 0 This Week

Last Update: 2022-08-02
See Project
19

Vote Based Point Recognition

Vote Based Object Detection Framework

View Wiki

Downloads: 0 This Week

Last Update: 2012-07-09
See Project
20

VoteNet

Deep Hough Voting for 3D Object Detection in Point Clouds

VoteNet is a 3D object detection framework for point clouds that combines deep point set networks with a Hough voting mechanism to localize and classify objects in 3D space. It tackles the challenge that object centroids in 3D scenes often don’t lie on any input surface point by having each point “vote” for potential object centers; these votes are then clustered to propose object hypotheses. Once cluster centers are formed, the network regresses bounding boxes around them and classifies them. VoteNet works end-to-end: it learns the voting, aggregation, and bounding-box regression components jointly, enabling strong detection accuracy without relying on 2D proxies or voxelization. The codebase includes data preparation for indoor datasets (SUN RGB-D, ScanNet), training and evaluation scripts, and demo utilities to visualize predicted boxes over point clouds.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
21

YOLO ROS

YOLO ROS: Real-Time Object Detection for ROS

This is a ROS package developed for object detection in camera images. You only look once (YOLO) is a state-of-the-art, real-time object detection system. In the following ROS package, you are able to use YOLO (V3) on GPU and CPU. The pre-trained model of the convolutional neural network is able to detect pre-trained classes including the data set from VOC and COCO, or you can also create a network with your own detection objects. The YOLO packages have been tested under ROS Noetic and Ubuntu 20.04. We also provide branches that work under ROS Melodic, ROS Foxy and ROS2. Darknet on the CPU is fast (approximately 1.5 seconds on an Intel Core i7-6700HQ CPU @ 2.60GHz × 8) but it's like 500 times faster on GPU! You'll have to have an Nvidia GPU and you'll have to install CUDA. The CMakeLists.txt file automatically detects if you have CUDA installed or not. CUDA is a parallel computing platform and application programming interface (API) model created by Nvidia.

Downloads: 0 This Week

Last Update: 2022-08-12
See Project
22

histogram-of-oriental-gadient

c++ implement for Dala's doctoral thesis histogram of oriental gradient algorithm for object detection.

Downloads: 0 This Week

Last Update: 2015-08-17
See Project
23

libsombrero

Astronomical object/structure detection from 1D and 2D data sets.

Sombrero is a fast wavelet image processing and object detection C library for astronomical images. Sombrero is named after the "Mexican Hat" shape of the wavelet masks used in image convolution and is released under the GNU LGPL library.

Downloads: 0 This Week

Last Update: 2026-02-22
See Project
24

node-opencv

OpenCV Bindings for node.js

OpenCV bindings for Node.js. OpenCV is the defacto computer vision library - by interfacing with it natively in node, we get powerful real time vision in js. People are using node-opencv to fly control quadrocoptors, detect faces from webcam images and annotate video streams. If you're using it for something cool, I'd love to hear about it! You'll need OpenCV 2.3.1 or newer installed before installing node-opencv. You can use opencv to read in image files. Supported formats are in the OpenCV docs, but jpgs etc are supported. There is a shortcut method for Viola-Jones Haar Cascade object detection. This can be used for face detection etc.

Downloads: 0 This Week

Last Update: 2022-01-13
See Project
25

satellite-image-deep-learning

Resources for deep learning with satellite & aerial imagery

This page lists resources for performing deep learning on satellite imagery. To a lesser extent classical Machine learning (e.g. random forests) are also discussed, as are classical image processing techniques. Note there is a huge volume of academic literature published on these topics, and this repository does not seek to index them all but rather list approachable resources with published code that will benefit both the research and developer communities. If you find this work useful please give it a star and consider sponsoring it. You can also follow me on Twitter and LinkedIn where I aim to post frequent updates on my new discoveries, and I have created a dedicated group on LinkedIn. I have also started a blog here and have published a post on the history of this repository called Dissecting the satellite-image-deep-learning repo.

Downloads: 0 This Week

Last Update: 2025-07-05
See Project