video detection free download

Showing 28 open source projects for "video detection"

View related business solutions

Software Development Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files.

1 Review

Downloads: 66 This Week

Last Update: 2026-04-05
See Project
2

MediaPipe

Cross-platform, customizable ML solutions for live and streaming media

MediaPipe offers open-source cross-platform, customizable ML solutions for live and streaming media. Provides segmentation masks for prominent humans in the scene. MediaPipe Face Mesh is a face geometry solution that estimates 468 3D face landmarks in real-time even on mobile devices. It employs machine learning (ML) to infer the 3D surface geometry, requiring only a single camera input without the need for a dedicated depth sensor. Utilizing lightweight model architectures together with GPU...

Downloads: 84 This Week

Last Update: 2026-04-23
See Project
3

trackers

Multi-object tracking algorithms

trackers is a plug-and-play multi-object tracking library designed to work with virtually any object detection model, enabling developers to follow objects across video frames with minimal setup. The library provides clean, modular implementations of leading tracking algorithms and can be used either from the command line or embedded directly into Python pipelines. It supports inputs such as videos, webcams, RTSP streams, or image directories and produces annotated tracking outputs that include labels and trajectories. ...

Downloads: 0 This Week

Last Update: 2026-03-16
See Project
4

BizHawk

BizHawk is a multi-system emulator written in C#

A multi-system emulator written in C#. As well as quality-of-life features for casual players, it also has recording/playback and debugging tools, making it the first choice for TASers (Tool-Assisted Speedrunners). Screenshotting and recording audio + video to file. Firmware management, input, framerate, and more in a HUD over the game. Rebindable hotkeys for controlling the frontend (keyboard+mouse+gamepad). A comprehensive input mapper for the emulated gamepads and other peripherals....

Downloads: 49 This Week

Last Update: 2025-09-20
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
5

OmniPull

Just pull anything

OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.

Downloads: 1 This Week

Last Update: 2026-04-13
See Project
6

Darknet YOLO

Real-Time Object Detection for Windows and Linux

This is YOLO-v3 and v2 for Windows and Linux. YOLO (You only look once) is a state-of-the-art, real-time object detection system of Darknet, an open source neural network framework in C. YOLO is extremely fast and accurate. It uses a single neural network to divide a full image into regions, and then predicts bounding boxes and probabilities for each region. This project is a fork of the original Darknet project.

Downloads: 29 This Week

Last Update: 2024-01-29
See Project
7

AugLy

A data augmentations library for audio, image, text, and video

...We designed AugLy to include many specific data augmentations that users perform in real life on internet platforms like Facebook's -- for example making an image into a meme, overlaying text/emojis on images/videos, reposting a screenshot from social media. While AugLy contains more generic data augmentations as well, it will be particularly useful to you if you're working on a problem like copy detection, hate speech detection, etc.

Downloads: 0 This Week

Last Update: 2022-03-29
See Project
8

jeelizFaceFilter

Javascript/WebGL lightweight face tracking library

This JavaScript library detects and tracks the face in real time from the camera video feed captured with WebRTC. Then it is possible to overlay 3D content for augmented reality applications. We provide various demonstrations using main WebGL 3D engines. We have included in this repository the release versions of the 3D engines to work with a determined version (they are in /libs/<name of the engine>/). This library is lightweight and it does not include any 3D engine or third-party library....

Downloads: 0 This Week

Last Update: 2022-02-21
See Project
9

PyTorchVideo

A deep learning library for video understanding research

PyTorchVideo is a deep learning library for video understanding, providing modular components and pretrained models for tasks like action recognition, video classification, detection, and self-supervised learning. It is tightly integrated with PyTorch and PyTorch Lightning, offering flexible APIs for building and training spatiotemporal networks. The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. ...

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

UniVL

Official implementation for UniVL video and language training models

UniVL is a video-language pretrain model. It is designed with four modules and five objectives for both video language understanding and generation tasks. It is also a flexible model for most of the multimodal downstream tasks considering both efficiency and effectiveness.

Downloads: 0 This Week

Last Update: 2024-07-12
See Project
11

Gluon CV Toolkit

Gluon CV Toolkit

...It features training scripts that reproduce SOTA results reported in latest papers, a large set of pre-trained models, carefully designed APIs and easy-to-understand implementations and community support. From fundamental image classification, object detection, semantic segmentation and pose estimation, to instance segmentation and video action recognition. The model zoo is the one-stop shopping center for many models you are expecting. GluonCV embraces a flexible development pattern while is super easy to optimize and deploy without retaining a heavyweight deep learning framework.

Downloads: 0 This Week

Last Update: 2021-11-01
See Project
12

OpenPose

Real-time multi-person keypoint detection library for body, face, etc.

OpenPose has represented the first real-time multi-person system to jointly detect human body, hand, facial, and foot keypoints (in total 135 keypoints) on single images. It is authored by Ginés Hidalgo, Zhe Cao, Tomas Simon, Shih-En Wei, Yaadhav Raaj, Hanbyul Joo, and Yaser Sheikh. It is maintained by Ginés Hidalgo and Yaadhav Raaj. OpenPose would not be possible without the CMU Panoptic Studio dataset. We would also like to thank all the people who has helped OpenPose in any way. 15, 18 or...

Downloads: 18 This Week

Last Update: 2022-07-28
See Project
13

XIVO

X Inertial-aided Visual Odometry

XIVO is an open-source repository for visual-inertial odometry/mapping. It is a simplified version of Corvis [Jones et al.,Tsotsos et al.], designed for pedagogical purposes, and incorporates odometry (relative motion of the sensor platform), local mapping (pose relative to a reference frame of the oldest visible features), and global mapping (pose relative to a global frame, including loop-closure and global re-localization, this feature, present in Corvis, is not yet incorporated in XIVO)....

Downloads: 1 This Week

Last Update: 2024-07-11
See Project
14

Computer Vision

Best Practices, code samples, and documentation for Computer Vision

In recent years, we've see an extra-ordinary growth in Computer Vision, with applications in face recognition, image understanding, search, drones, mapping, semi-autonomous and autonomous vehicles. A key part to many of these applications are visual recognition tasks such as image classification, object detection and image similarity. This repository provides examples and best practice guidelines for building computer vision systems. The goal of this repository is to build a comprehensive...

Downloads: 0 This Week

Last Update: 2021-07-20
See Project
15

GPUImage 2

Framework for GPU-accelerated video and image processing

...By relying on the GPU to run these operations, performance improvements of 100X or more over CPU-bound code can be realized. This is particularly noticeable in mobile or embedded devices. On an iPhone 4S, this framework can easily process 1080p video at over 60 FPS. On a Raspberry Pi 3, it can perform Sobel edge detection on live 720p video at over 20 FPS.

Downloads: 7 This Week

Last Update: 2021-11-17
See Project
16

ftdetector

File type detector library

This project is a tool to detect file types by signatures and mime types. It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON, etc.) and text file UNICODE charsets by using their BOMs (Byte order marks). ...

Downloads: 0 This Week

Last Update: 2019-04-08
See Project
17

Simd

High performance image processing library in C++

The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library...

3 Reviews

Downloads: 34 This Week

Last Update: 2019-02-01
See Project
18

OpenAuto

AndroidAuto headunit emulator

OpenAuto is an AndroidAuto(tm) headunit emulator based on aasdk library and Qt libraries. Main goal is to run this application on the RaspberryPI 3 board computer smoothly. The OpenAuto Pro is the most advanced Raspberry Pi-based, custom head-unit solution ready to retrofit your vehicle. The main functionality of the OpenAuto Pro software is to bring Users access to modern head-unit features like Bluetooth Hands-Free Profile, music streaming, integrated media player, navigation via Android...

Downloads: 40 This Week

Last Update: 2022-06-01
See Project
19

clmtrackr

Javascript library for precise tracking of facial features

clmtrackr is a javascript library for fitting facial models to faces in videos or images. It currently is an implementation of constrained local models fitted by regularized landmark mean-shift, as described in Jason M. Saragih's paper. clmtrackr tracks a face and outputs the coordinate positions of the face model as an array. The library provides some generic face models that were trained on the MUCT database and some additional self-annotated images. Check out clmtools for building your...

Downloads: 0 This Week

Last Update: 2021-09-02
See Project
20

node-opencv

OpenCV Bindings for node.js

...This can be used for face detection etc.

Downloads: 0 This Week

Last Update: 2022-01-13
See Project
21

GPUImage

iOS framework for GPU-based image and video processing

The GPUImage framework is a BSD-licensed iOS library that lets you apply GPU-accelerated filters and other effects to images, live camera video, and movies. In comparison to Core Image (part of iOS 5.0), GPUImage allows you to write your own custom filters, supports deployment to iOS 4.0, and has a slightly simpler interface. However, it currently lacks some of the more advanced features of Core Image, such as facial detection. GPUImage uses OpenGL ES 2.0 shaders to perform image and video manipulation much faster than could be done in CPU-bound routines. ...

Downloads: 0 This Week

Last Update: 2023-02-09
See Project
22

ANNGD

Artificial neural network gender detection

ANNGD is a artificial neural network gender detection application. The detection is made in real time images captured by webcam by opencv library. The package include applications to image pre-processing and artificial neural network back-propagation training.

Downloads: 0 This Week

Last Update: 2015-05-26
See Project
23

Melon

Kinect based controller.

Melon's purpose is to simplify as much as possible the setup of interactive installation (art, cultural, commercial, etc) as much as possible. It broadcast in OSC 3D pointers. Each pointer can be attached to any of the bone of the skeleton. The interaction area defined for this pointer is attached to another bone of the skeleton, therefore the visitor is free to move in the space defined. Video tutorial: https://vimeo.com/73297050 Developped by http://www.numediart.org [!] >...

2 Reviews

Downloads: 0 This Week

Last Update: 2014-07-24
See Project
24

GP2X Hardware Library

Library which allows developper to access the hardware features of GP2X (personal entertainment player created by a Korean company: GPH). Software emulation is provided on PC through SDL.

Downloads: 1 This Week

Last Update: 2015-11-23
See Project
25

BayesianCortex

simple algorithm for a realtime interactive visual cortex for painting

A paint program where the canvas is the visual cortex of a simple kind of artificial intelligence. You paint with the mouse into its dreams and it responds by changing what you painted gradually. There will also be an API for using it with other programs as a general high-dimensional space. Each pixel's brightness is its own dimension. Bayesian nodes have exactly 3 childs because that is all thats needed to do NAND in a fuzzy way as Bayes' Rule which is NAND at certain extremes. NAND can be...

Downloads: 0 This Week

Last Update: 2013-05-30
See Project