Showing 28 open source projects for "video detection"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Video-subtitle-extractor

    Video-subtitle-extractor

    A GUI tool for extracting hard-coded subtitle (hardsub) from videos

    Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files.
    Downloads: 66 This Week
    Last Update:
    See Project
  • 2
    MediaPipe

    MediaPipe

    Cross-platform, customizable ML solutions for live and streaming media

    MediaPipe offers open-source cross-platform, customizable ML solutions for live and streaming media. Provides segmentation masks for prominent humans in the scene. MediaPipe Face Mesh is a face geometry solution that estimates 468 3D face landmarks in real-time even on mobile devices. It employs machine learning (ML) to infer the 3D surface geometry, requiring only a single camera input without the need for a dedicated depth sensor. Utilizing lightweight model architectures together with GPU...
    Downloads: 84 This Week
    Last Update:
    See Project
  • 3
    trackers

    trackers

    Multi-object tracking algorithms

    trackers is a plug-and-play multi-object tracking library designed to work with virtually any object detection model, enabling developers to follow objects across video frames with minimal setup. The library provides clean, modular implementations of leading tracking algorithms and can be used either from the command line or embedded directly into Python pipelines. It supports inputs such as videos, webcams, RTSP streams, or image directories and produces annotated tracking outputs that include labels and trajectories. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    BizHawk

    BizHawk

    BizHawk is a multi-system emulator written in C#

    A multi-system emulator written in C#. As well as quality-of-life features for casual players, it also has recording/playback and debugging tools, making it the first choice for TASers (Tool-Assisted Speedrunners). Screenshotting and recording audio + video to file. Firmware management, input, framerate, and more in a HUD over the game. Rebindable hotkeys for controlling the frontend (keyboard+mouse+gamepad). A comprehensive input mapper for the emulated gamepads and other peripherals....
    Downloads: 49 This Week
    Last Update:
    See Project
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 5
    OmniPull

    OmniPull

    Just pull anything

    OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Darknet YOLO

    Darknet YOLO

    Real-Time Object Detection for Windows and Linux

    This is YOLO-v3 and v2 for Windows and Linux. YOLO (You only look once) is a state-of-the-art, real-time object detection system of Darknet, an open source neural network framework in C. YOLO is extremely fast and accurate. It uses a single neural network to divide a full image into regions, and then predicts bounding boxes and probabilities for each region. This project is a fork of the original Darknet project.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 7
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    ...We designed AugLy to include many specific data augmentations that users perform in real life on internet platforms like Facebook's -- for example making an image into a meme, overlaying text/emojis on images/videos, reposting a screenshot from social media. While AugLy contains more generic data augmentations as well, it will be particularly useful to you if you're working on a problem like copy detection, hate speech detection, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    jeelizFaceFilter

    jeelizFaceFilter

    Javascript/WebGL lightweight face tracking library

    This JavaScript library detects and tracks the face in real time from the camera video feed captured with WebRTC. Then it is possible to overlay 3D content for augmented reality applications. We provide various demonstrations using main WebGL 3D engines. We have included in this repository the release versions of the 3D engines to work with a determined version (they are in /libs/<name of the engine>/). This library is lightweight and it does not include any 3D engine or third-party library....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    PyTorchVideo

    PyTorchVideo

    A deep learning library for video understanding research

    PyTorchVideo is a deep learning library for video understanding, providing modular components and pretrained models for tasks like action recognition, video classification, detection, and self-supervised learning. It is tightly integrated with PyTorch and PyTorch Lightning, offering flexible APIs for building and training spatiotemporal networks. The library includes efficient implementations of state-of-the-art architectures such as SlowFast, X3D, and MViT, optimized for both research prototyping and production inference. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    UniVL

    UniVL

    Official implementation for UniVL video and language training models

    UniVL is a video-language pretrain model. It is designed with four modules and five objectives for both video language understanding and generation tasks. It is also a flexible model for most of the multimodal downstream tasks considering both efficiency and effectiveness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Gluon CV Toolkit

    Gluon CV Toolkit

    Gluon CV Toolkit

    ...It features training scripts that reproduce SOTA results reported in latest papers, a large set of pre-trained models, carefully designed APIs and easy-to-understand implementations and community support. From fundamental image classification, object detection, semantic segmentation and pose estimation, to instance segmentation and video action recognition. The model zoo is the one-stop shopping center for many models you are expecting. GluonCV embraces a flexible development pattern while is super easy to optimize and deploy without retaining a heavyweight deep learning framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OpenPose

    OpenPose

    Real-time multi-person keypoint detection library for body, face, etc.

    OpenPose has represented the first real-time multi-person system to jointly detect human body, hand, facial, and foot keypoints (in total 135 keypoints) on single images. It is authored by Ginés Hidalgo, Zhe Cao, Tomas Simon, Shih-En Wei, Yaadhav Raaj, Hanbyul Joo, and Yaser Sheikh. It is maintained by Ginés Hidalgo and Yaadhav Raaj. OpenPose would not be possible without the CMU Panoptic Studio dataset. We would also like to thank all the people who has helped OpenPose in any way. 15, 18 or...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 13
    XIVO

    XIVO

    X Inertial-aided Visual Odometry

    XIVO is an open-source repository for visual-inertial odometry/mapping. It is a simplified version of Corvis [Jones et al.,Tsotsos et al.], designed for pedagogical purposes, and incorporates odometry (relative motion of the sensor platform), local mapping (pose relative to a reference frame of the oldest visible features), and global mapping (pose relative to a global frame, including loop-closure and global re-localization, this feature, present in Corvis, is not yet incorporated in XIVO)....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Computer Vision

    Computer Vision

    Best Practices, code samples, and documentation for Computer Vision

    In recent years, we've see an extra-ordinary growth in Computer Vision, with applications in face recognition, image understanding, search, drones, mapping, semi-autonomous and autonomous vehicles. A key part to many of these applications are visual recognition tasks such as image classification, object detection and image similarity. This repository provides examples and best practice guidelines for building computer vision systems. The goal of this repository is to build a comprehensive...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    GPUImage 2

    GPUImage 2

    Framework for GPU-accelerated video and image processing

    ...By relying on the GPU to run these operations, performance improvements of 100X or more over CPU-bound code can be realized. This is particularly noticeable in mobile or embedded devices. On an iPhone 4S, this framework can easily process 1080p video at over 60 FPS. On a Raspberry Pi 3, it can perform Sobel edge detection on live 720p video at over 20 FPS.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16

    ftdetector

    File type detector library

    This project is a tool to detect file types by signatures and mime types. It uses hash tables to make the detection of a file type as fast as possible. The signature and mime types lists are stored at simple user-friendly files. This file type detector supports a lot of formats (image, archive, text, documents, audio, video, fonts and others). It also includes Microsoft OLE compound file types. The detector's algorythm has special features to detect text file types like (HTML, XML, JSON, etc.) and text file UNICODE charsets by using their BOMs (Byte order marks). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Simd

    Simd

    High performance image processing library in C++

    The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 18
    OpenAuto

    OpenAuto

    AndroidAuto headunit emulator

    OpenAuto is an AndroidAuto(tm) headunit emulator based on aasdk library and Qt libraries. Main goal is to run this application on the RaspberryPI 3 board computer smoothly. The OpenAuto Pro is the most advanced Raspberry Pi-based, custom head-unit solution ready to retrofit your vehicle. The main functionality of the OpenAuto Pro software is to bring Users access to modern head-unit features like Bluetooth Hands-Free Profile, music streaming, integrated media player, navigation via Android...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 19
    clmtrackr

    clmtrackr

    Javascript library for precise tracking of facial features

    clmtrackr is a javascript library for fitting facial models to faces in videos or images. It currently is an implementation of constrained local models fitted by regularized landmark mean-shift, as described in Jason M. Saragih's paper. clmtrackr tracks a face and outputs the coordinate positions of the face model as an array. The library provides some generic face models that were trained on the MUCT database and some additional self-annotated images. Check out clmtools for building your...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    node-opencv

    node-opencv

    OpenCV Bindings for node.js

    ...This can be used for face detection etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    GPUImage

    GPUImage

    iOS framework for GPU-based image and video processing

    The GPUImage framework is a BSD-licensed iOS library that lets you apply GPU-accelerated filters and other effects to images, live camera video, and movies. In comparison to Core Image (part of iOS 5.0), GPUImage allows you to write your own custom filters, supports deployment to iOS 4.0, and has a slightly simpler interface. However, it currently lacks some of the more advanced features of Core Image, such as facial detection. GPUImage uses OpenGL ES 2.0 shaders to perform image and video manipulation much faster than could be done in CPU-bound routines. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    ANNGD

    Artificial neural network gender detection

    ANNGD is a artificial neural network gender detection application. The detection is made in real time images captured by webcam by opencv library. The package include applications to image pre-processing and artificial neural network back-propagation training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Melon

    Melon

    Kinect based controller.

    Melon's purpose is to simplify as much as possible the setup of interactive installation (art, cultural, commercial, etc) as much as possible. It broadcast in OSC 3D pointers. Each pointer can be attached to any of the bone of the skeleton. The interaction area defined for this pointer is attached to another bone of the skeleton, therefore the visitor is free to move in the space defined. Video tutorial: https://vimeo.com/73297050 Developped by http://www.numediart.org [!] >...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    GP2X Hardware Library
    Library which allows developper to access the hardware features of GP2X (personal entertainment player created by a Korean company: GPH). Software emulation is provided on PC through SDL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    BayesianCortex

    BayesianCortex

    simple algorithm for a realtime interactive visual cortex for painting

    A paint program where the canvas is the visual cortex of a simple kind of artificial intelligence. You paint with the mouse into its dreams and it responds by changing what you painted gradually. There will also be an API for using it with other programs as a general high-dimensional space. Each pixel's brightness is its own dimension. Bayesian nodes have exactly 3 childs because that is all thats needed to do NAND in a fuzzy way as Bayes' Rule which is NAND at certain extremes. NAND can be...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB