detection free download

158 projects for "detection" with 2 filters applied:

Artificial Intelligence BSD Clear Filters & Widen Search

Atera - an All-in-one platform for IT management
Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now
Auth0 B2B Essentials: SSO, MFA, and RBAC Built In
Unlimited organizations, 3 enterprise SSO connections, role-based access control, and pro MFA included. Dev and prod tenants out of the box.

Auth0's B2B Essentials plan gives you everything you need to ship secure multi-tenant apps. Unlimited orgs, enterprise SSO, RBAC, audit log streaming, and higher auth and API limits included. Add on M2M tokens, enterprise MFA, or additional SSO connections as you scale.

Sign Up Free
1

Anomaly Detection Learning Resources

Anomaly detection related books, papers, videos, and toolboxes

Anomaly Detection Learning Resources is a curated open-source repository that collects educational materials, tools, and academic references related to anomaly detection and outlier analysis in data science. The project serves as a centralized index for researchers and practitioners who want to explore algorithms, datasets, and publications associated with detecting unusual patterns in data.

Downloads: 0 This Week

Last Update: 2026-03-04
See Project
2

Frigate

NVR with realtime local object detection for IP cameras

Frigate - NVR With Realtime Object Detection for IP Cameras A complete and local NVR designed for Home Assistant with AI object detection. Uses OpenCV and Tensorflow to perform realtime object detection locally for IP cameras. Use of a Google Coral Accelerator is optional, but highly recommended. The Coral will outperform even the best CPUs and can process 100+ FPS with very little overhead.

Downloads: 40 This Week

Last Update: 2026-03-19
See Project
3

RF-DETR

RF-DETR is a real-time object detection and segmentation

RF-DETR is an open-source computer vision framework that implements a real-time object detection and instance segmentation model based on transformer architectures. Developed by Roboflow, the project builds upon modern vision transformer backbones such as DINOv2 to achieve strong accuracy while maintaining efficient inference speeds suitable for real-time applications. The model is designed to detect objects and segment them within images or video streams using a unified detection pipeline. ...

Downloads: 2 This Week

Last Update: 2026-05-28
See Project
4

CutLER

Code release for Cut and Learn for Unsupervised Object Detection

CutLER is an approach for unsupervised object detection and instance segmentation that trains detectors without human-annotated labels, and the repo also includes VideoCutLER for unsupervised video instance segmentation. The method follows a “Cut-and-LEaRn” recipe: bootstrap object proposals, refine them iteratively, and train detection/segmentation heads to discover objects across diverse datasets.

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
5

RealtimeSTT

A robust, efficient, low-latency speech-to-text library

RealtimeSTT is a Python-based realtime speech-to-text engine emphasizing low latency, wake-word detection, voice activity detection, and automatic speech segmentation. It provides asynchronous callbacks, nanosecond-precision timestamps, and CLI tools, suitable for building voice assistants, meeting transcribers, or live caption systems.

Downloads: 2 This Week

Last Update: 2026-05-31
See Project
6

GeoAI

GeoAI: Artificial Intelligence for Geospatial Data

...It provides a unified framework that combines machine learning libraries such as PyTorch and Transformers with geospatial tools, allowing users to process satellite imagery, aerial photos, and vector datasets in a streamlined workflow. The platform supports a wide range of tasks including image classification, object detection, segmentation, and change detection, making it suitable for applications in environmental monitoring, urban planning, and disaster response. GeoAI simplifies complex workflows by offering high-level APIs that abstract data preprocessing, model training, and inference, reducing the technical barrier for users who are not experts in both AI and geospatial systems.

Downloads: 6 This Week

Last Update: 2026-06-02
See Project
7

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion

Grounded-Segment-Anything is a research-oriented project that combines powerful open-set object detection with pixel-level segmentation and subsequent creative workflows, effectively enabling detection, segmentation, and high-level vision tasks guided by free-form text prompts. The core idea behind the project is to pair Grounding DINO — a zero-shot object detector that can locate objects described by natural language — with Segment Anything Model (SAM), which can produce detailed masks for objects once they are localized. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
8

Frigate NVR

NVR with realtime local object detection for IP cameras

Frigate is a local network video recorder designed for real-time object detection on IP camera streams using machine learning. It runs entirely on local hardware and integrates closely with Home Assistant to provide smart surveillance without relying on cloud processing. The system uses OpenCV and TensorFlow to analyze video feeds and detect objects such as people, vehicles, and animals in real time. Frigate is optimized for efficiency and supports hardware acceleration across a wide range of devices, including GPUs and specialized inference hardware. ...

Downloads: 4 This Week

Last Update: 2026-03-19
See Project
9

BoxMOT

Pluggable SOTA multi-object tracking modules for segmentation

BoxMOT is an open-source framework designed to provide modular implementations of state-of-the-art multi-object tracking algorithms for computer vision applications. The project focuses on the tracking-by-detection paradigm, where objects detected by vision models are continuously tracked across frames in a video sequence. It provides a pluggable architecture that allows developers to combine different object detectors with multiple tracking algorithms without modifying the core codebase. The framework supports integration with detection, segmentation, and pose estimation models that produce bounding box outputs. ...

Downloads: 0 This Week

Last Update: 2026-06-03
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

...A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented as a sequence of tokens to be predicted by the decoder, allowing a single model to replace many stages of a traditional speech-processing pipeline. The multitask training format uses a set of special tokens that serve as task specifiers or classification targets.

Downloads: 74 This Week

Last Update: 2025-06-26
See Project
11

claude-obsidian

Claude + Obsidian knowledge companion

...The system follows the LLM Wiki pattern, where information is stored as persistent markdown files that grow richer over time through cross-referencing and synthesis. It includes features such as contradiction detection, orphaned note identification, and automatic indexing. A persistent memory layer ensures continuity across sessions, eliminating the need for repeated context. It also performs autonomous research to fill knowledge gaps and expand the knowledge base. Overall, it turns note-taking into an active, compounding intelligence system.

Downloads: 4 This Week

Last Update: 2026-05-28
See Project
12

ComfyUI-Copilot

AI assistant for ComfyUI workflow generation, debugging, and tuning

...ComfyUI-Copilot leverages large language model capabilities to analyze user intent, recommend nodes, and suggest models that match specific requirements. It also provides automated error detection and repair suggestions, improving reliability during development.

Downloads: 4 This Week

Last Update: 2026-03-18
See Project
13

SenseVoice

Multilingual speech recognition and audio understanding model

SenseVoice is a speech foundation model designed to perform multiple voice understanding tasks from audio input. It provides capabilities such as automatic speech recognition, spoken language identification, speech emotion recognition, and audio event detection within a single system. SenseVoice is trained on more than 400,000 hours of speech data and supports over 50 languages for multilingual recognition tasks. It is built to achieve high transcription accuracy while maintaining efficient inference performance. It includes different model variants optimized for either speed or accuracy, allowing developers to choose a configuration suitable for their use case. ...

Downloads: 5 This Week

Last Update: 2026-05-31
See Project
14

Trail of Bits Skills Marketplace

Trail of Bits Claude Code skills for security research, vulnerability

Trail of Bits Skills Marketplace is a specialized Claude Code skills marketplace built by the security research firm Trail of Bits that focuses on enhancing AI-assisted workflows for vulnerability discovery, testing, and secure development. The repository groups a set of plug-in skills tailored toward static analysis, code auditing, secure defaults detection, and other practices that matter in software security. Users can easily add the marketplace to a Claude Code environment, browse available plugins, and install specific skills for tasks like automatic Semgrep rule creation, entry-point analysis in smart contracts, or insecure defaults detection. This project leverages the agent skills architecture to let AI assistants take on detailed, repeatable security procedures that are typically manual, such as parsing Burp Suite projects or conducting variant analysis across codebases.

Downloads: 5 This Week

Last Update: 6 days ago
See Project
15

AI YouTube Shorts Generator

A python tool that uses GPT-4, FFmpeg, and OpenCV

...It analyzes input video (whether a local file or a YouTube URL), transcribes audio (with optional GPU-accelerated speech-to-text), uses an AI model to identify the most compelling or engaging segments, and then crops/resizes the video and applies subtitle overlays, producing a polished short video without manual editing. The tool streamlines multiple steps of the tedious short-form video workflow: highlight detection, clipping, subtitle generation, cropping to vertical 9:16 format, and final rendering — reducing hours of editing to a mostly automated pipeline. Because it supports both local and online video sources, it's flexible whether you're working with your own recorded content or repurposing existing longer-form videos.

Downloads: 11 This Week

Last Update: 6 days ago
See Project
16

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification

...It also includes utilities for visualizing audio features and analyzing patterns within sound recordings, which can be useful in applications such as speech recognition, music classification, and acoustic event detection. Because the library integrates machine learning algorithms with signal processing tools, it enables researchers to develop complete audio analysis pipelines using a single framework.

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
17

Ultralytics

Ultralytics YOLO

Ultralytics is a comprehensive computer vision framework that provides state-of-the-art implementations of the YOLO (You Only Look Once) family of models, enabling developers to perform tasks such as object detection, segmentation, classification, tracking, and pose estimation within a unified system. It is designed to be fast, accurate, and easy to use, offering both command-line and Python-based interfaces for training, validation, and deployment of machine learning models. The framework supports a full end-to-end workflow, including dataset preparation, model training, evaluation, and export to various deployment formats. ...

Downloads: 1 This Week

Last Update: 18 hours ago
See Project
18

The Machine & Deep Learning Compendium

List of references in my private & single document

...Originally created as a personal knowledge base, the repository evolved into a public educational resource designed to help learners explore the rapidly expanding machine learning ecosystem. The compendium includes explanations of concepts across multiple domains such as natural language processing, computer vision, time-series analysis, anomaly detection, and graph learning. In addition to technical algorithms, the project also covers practical topics related to data science workflows, engineering practices, and product development in AI systems.

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
19

Interactive Machine Learning Experiments

Interactive Machine Learning experiments

...The project combines Jupyter or Colab notebooks with browser-based visual demos that allow users to see trained models operating in real time. Many experiments involve tasks such as image classification, object detection, gesture recognition, and simple generative models. The models are typically trained in Python using TensorFlow and then exported for interactive demonstrations in a web environment using JavaScript and TensorFlow.js. Because the project focuses on experimentation rather than production systems, it acts as a sandbox where developers can explore machine learning concepts and observe model behavior. ...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
20

llmfit

157 models, 30 providers, one command to find what runs on hardware

llmfit is a terminal-based utility that helps developers determine which large language models can realistically run on their local hardware by analyzing system resources and model requirements. The tool automatically detects CPU, RAM, GPU, and VRAM specifications, then ranks available models based on performance factors such as speed, quality, and memory fit. It provides both an interactive terminal user interface and a traditional CLI mode, enabling flexible workflows for different user...

Downloads: 29 This Week

Last Update: 7 days ago
See Project
21

imgclsmob Deep learning networks

Sandbox for training deep learning networks

...The project serves as a sandbox for training and evaluating a wide variety of neural network architectures used in image analysis. It includes implementations of models used for tasks such as image classification, object detection, semantic segmentation, and pose estimation. The repository also contains scripts that help train models, evaluate performance, and convert trained networks between different frameworks. Several deep learning frameworks are supported, allowing researchers to experiment with architectures in different environments. The project is frequently used by developers who want to study modern convolutional neural network designs and compare their performance across datasets.

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
22

Open Vision Agents by Stream

Build Vision Agents quickly with any model or video provider

...Developers work with an agent abstraction that connects video edge providers, LLMs, and processors into pipelines, making it easier to orchestrate tasks like object detection, pose estimation, and conversational guidance. The project includes SDKs for React, Android, iOS, Flutter, React Native, and Unity, enabling integration into a wide variety of client environments such as mobile apps, web apps, and games.

Downloads: 1 This Week

Last Update: 2026-06-09
See Project
23

Advanced AI explainability for PyTorch

Advanced AI Explainability for computer vision

...These visualization techniques allow developers and researchers to better understand how convolutional neural networks and transformer-based vision models make predictions. The library supports a wide variety of tasks including image classification, object detection, semantic segmentation, and similarity analysis. It also provides metrics and evaluation tools that help measure the reliability and quality of the generated explanations. By integrating easily with PyTorch models, the library allows developers to diagnose model errors, detect biases in datasets, and improve model transparency.

Downloads: 0 This Week

Last Update: 2026-05-21
See Project
24

MediaPipe Solutions

Cross-platform, customizable ML solutions

...These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. The framework includes prebuilt solutions that developers can quickly integrate into applications as well as lower-level APIs that allow custom pipeline construction.

Downloads: 0 This Week

Last Update: 2026-04-23
See Project
25

Youtu-GraphRAG

Vertically Unified Agents for Graph Retrieval-Augmented Reasoning

...These structures allow the system to perform multi-hop reasoning by decomposing complex questions into smaller queries that can be executed across different parts of the graph. The framework also incorporates hierarchical community detection algorithms that organize knowledge into clusters, improving both retrieval efficiency and reasoning performance. In addition to graph construction and retrieval, the system integrates iterative reasoning techniques that refine answers through multiple retrieval and reasoning cycles.

Downloads: 0 This Week

Last Update: 2026-03-09
See Project