Showing 51 open source projects for "edge detection"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    DeepCamera

    DeepCamera

    Open-Source AI Camera. Empower any camera/CCTV

    DeepCamera empowers your traditional surveillance cameras and CCTV/NVR with machine learning technologies. It provides open-source facial recognition-based intrusion detection, fall detection, and parking lot monitoring with the inference engine on your local device. SharpAI-hub is the cloud hosting for AI applications that helps you deploy AI applications with your CCTV camera on your edge device in minutes. SharpAI yolov7_reid is an open-source Python application that leverages AI technologies to detect intruders with traditional surveillance cameras. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 2
    Anomalib

    Anomalib

    An anomaly detection library comprising state-of-the-art algorithms

    Anomalib is an open-source deep learning library focused on anomaly detection and localization tasks, collecting state-of-the-art algorithms and tools under one modular framework. It provides implementations of leading anomaly detection methods drawn from current research, as well as a full set of utilities for training, evaluating, benchmarking, and deploying these models on both public and private datasets.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    Open Vision Agents by Stream

    Open Vision Agents by Stream

    Build Vision Agents quickly with any model or video provider

    ...It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio and video latency while processing frames and generating responses. Developers work with an agent abstraction that connects video edge providers, LLMs, and processors into pipelines, making it easier to orchestrate tasks like object detection, pose estimation, and conversational guidance. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    TEN

    TEN

    Open-source framework for conversational voice AI agents

    TEN (Transformative Extensions Network) is an open source framework designed to empower developers to build real-time multimodal AI agents capable of voice, video, text, image, and data-stream interaction with ultra-low latency. It includes a full ecosystem, TEN Turn Detection, TEN Agent, and TMAN Designer, allowing developers to rapidly assemble human-like, responsive agents that can see, speak, hear, and interact. With support for languages like Python, C++, and Go, it offers flexible deployment on both edge and cloud environments. Using components like graph-based workflow design, drag-and-drop UI (via TMAN Designer), and reusable extensions such as real-time avatars, RAG (Retrieval-Augmented Generation), and image generation, TEN enables highly customizable, scalable agent development with minimal code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    ML.NET

    ML.NET

    Open source and cross-platform machine learning framework for .NET

    ...ML.NET lets you re-use all the knowledge, skills, code, and libraries you already have as a .NET developer so that you can easily integrate machine learning into your web, mobile, desktop, games, and IoT apps. ML.NET offers Model Builder (a simple UI tool) and ML.NET CLI to make it super easy to build custom ML Models. These tools use Automated ML (AutoML), a cutting edge technology that automates the process of building best performing models for your Machine Learning scenario. All you have to do is load your data, and AutoML takes care of the rest of the model building process. ML.NET has been designed as an extensible platform so that you can consume other popular ML frameworks (TensorFlow, ONNX, Infer.NET, and more) and have access to even more machine learning scenarios, like image classification, object detection, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    MediaPipe Solutions

    MediaPipe Solutions

    Cross-platform, customizable ML solutions

    ...The system provides developers with tools and reusable components that allow them to combine multiple machine learning models with preprocessing and postprocessing logic into efficient perception pipelines. These pipelines can run on a wide variety of platforms including mobile devices, desktop systems, web browsers, and embedded edge devices. MediaPipe is widely used in computer vision and multimedia applications such as hand tracking, face detection, pose estimation, object recognition, and gesture analysis. The framework includes prebuilt solutions that developers can quickly integrate into applications as well as lower-level APIs that allow custom pipeline construction.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Qwen-Image

    Qwen-Image

    Qwen-Image is a powerful image generation foundation model

    ...Qwen-Image supports sophisticated editing tasks such as style transfer, object insertion and removal, detail enhancement, and even human pose manipulation, making it suitable for both professional and casual users. It also includes advanced image understanding capabilities like object detection, semantic segmentation, depth and edge estimation, and novel view synthesis.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 8
    Betaflight

    Betaflight

    Open Source Flight Controller Firmware

    Betaflight is flight controller software (firmware) used to fly multi-rotor craft and fixed wing craft. This fork differs from Baseflight and Cleanflight in that it focuses on flight performance, leading-edge feature additions, and wide target support. Betaflight supports a wide range of flight controllers with at least an STM32F4 MCU. You can use nearly any piece of modern hardware out there. Full telemetry and high refresh rate for receivers, camera and VTX control, RGB LEDs, and much...
    Downloads: 348 This Week
    Last Update:
    See Project
  • 9
    Nixtla TimeGPT

    Nixtla TimeGPT

    TimeGPT-1: production ready pre-trained Time Series Foundation Model

    TimeGPT is a production ready, generative pretrained transformer for time series. It's capable of accurately predicting various domains such as retail, electricity, finance, and IoT with just a few lines of code. Whether you're a bank forecasting market trends or a startup predicting product demand, TimeGPT democratizes access to cutting-edge predictive insights, eliminating the need for a dedicated team of machine learning engineers. A generative model for time series. TimeGPT is capable of...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 10
    CyberScraper 2077

    CyberScraper 2077

    A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

    CyberScraper 2077 is not just another web scraping tool – it's a glimpse into the future of data extraction. Born from the neon-lit streets of a cyberpunk world, this AI-powered scraper uses OpenAI, Gemini and LocalLLM Models to slice through the web's defenses, extracting the data you need with unparalleled precision and style.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Bailing

    Bailing

    Bailing is a voice dialogue robot similar to GPT-4o

    ...It aims to be light enough to run without a GPU, making it usable on modest hardware or edge devices, while still maintaining low latency and smooth interaction. Bailing includes a memory system, giving the assistant the ability to remember user preferences and context across sessions, which enables more personalized and context-aware conversations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DebugSwift

    DebugSwift

    A toolkit to make debugging iOS applications easier

    DebugSwift is a comprehensive open-source toolkit aimed at iOS developers that streamlines and enhances the debugging experience for Swift-based applications by providing a rich set of interactive diagnostic tools. Designed to operate within the iOS environment (iOS 14+ and Swift 6+), it includes network and WebSocket inspectors for HTTP requests, detailed performance metrics (CPU, memory, FPS), and mechanisms to detect memory leaks and main thread violations in real time. The toolkit also...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Porcupine

    Porcupine

    On-device wake word detection powered by deep learning

    Build always-listening yet private voice applications. Porcupine is a highly-accurate and lightweight wake word engine. It enables building always-listening voice-enabled applications. It is using deep neural networks trained in real-world environments. Compact and computationally-efficient. It is perfect for IoT. Cross-platform. Arm Cortex-M, STM32, PSoC, Arduino, and i.MX RT. Raspberry Pi, NVIDIA Jetson Nano, and BeagleBone. Android and iOS. Chrome, Safari, Firefox, and Edge. Linux...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    dots.ocr

    dots.ocr

    Multilingual Document Layout Parsing in a Single Vision-Language Model

    dots.ocr is a cutting-edge multilingual document parsing system built on a unified vision-language model that combines layout detection, text recognition, and structural understanding into a single architecture. Unlike traditional OCR pipelines that rely on multiple specialized components, dots.ocr integrates these processes end-to-end, reducing error propagation and improving consistency across tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Caire

    Caire

    Content aware image resize library

    Caire is a content aware image resize library based on Seam Carving for Content-Aware Image Resizing paper. An energy map (edge detection) is generated from the provided image. The algorithm tries to find the least important parts of the image taking into account the lowest energy values. Using a dynamic programming approach the algorithm will generate individual seams across the image from top to down, or from left to right (depending on the horizontal or vertical resizing) and will allocate for each seam a custom value, the least important pixels having the lowest energy cost and the most important ones having the highest cost. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Crypto-Pump-Bot-Ai-Powered

    Crypto-Pump-Bot-Ai-Powered

    A cutting-edge AI-driven cryptocurrency trading bot designed to detect

    A cutting-edge AI-driven cryptocurrency trading bot designed to detect and respond to market pump-and-dump activities. This bot leverages advanced machine learning models and sophisticated analysis techniques to identify opportunities, analyze market trends, and execute trades with precision.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SyberiaOS
    Syberia Project (aka Syberia OS) is a custom ROM for many devices that implements various features. This project was designed by our "Syberia Team" from scratch based on AOSP source code with some CAF fixes and improvements. Our team: @blinoff82, @DennySPB, @alexxxdev. Also many thanks to our designer @rlshukhov for logos, wallpapers and bootanimation. If you like our work you can you can buy us a cup of coffee or a glass of beer here: https://syberiaos.com/links
    Downloads: 73 This Week
    Last Update:
    See Project
  • 18
    AudioEnhancerMAX

    AudioEnhancerMAX

    Open-source AI audio processing suite: 100% local, free, no limits.

    AudioEnhancerMAX is the open-source AI-powered audio media center for podcasters, creators, and professionals. It runs 100% locally on your hardware — no cloud, no subscriptions, no limits. Features: 16+ intelligent audio filters (AI noise removal, filler word detection, breath reduction, studio sound, audio super-resolution), Gemma 4 AI dynamic parameter tuning, distributed edge computing across Android smartphones, real-time system monitoring for Apple Silicon. Built with FastAPI, Whisper Large-v3, Pedalboard (Spotify), noisereduce, DeepFilterNet, and Gemma 4 via Ollama. Optimized for Apple Silicon M3 MAX. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    pinvault-pro-extension

    pinvault-pro-extension

    AI-powered browser extension for bulk downloading Pinterest images wit

    ...The extension provides intelligent bulk selection, auto-scroll functionality, and maintains HD quality downloads while respecting Pinterest's terms of service. 🎯 Core Features: • Visual checkbox overlays on Pinterest images for intuitive selection • AI-powered auto-scroll with smart content detection • Bulk download capabilities with real-time progress tracking • HD quality preservation in original resolution • Modern glassmorphism interface design • Privacy-focused architecture with no external servers • Cross-browser compatibility (Firefox, Chrome, Edge, Opera) • Real-time synchronization between popup and sidebar interfaces 🌍 Universal Browser Support: • Firefox: Official Mozilla Add-ons Store approval (61.4 KB package) • Chrome/Edge: Manifest V3 compatible (60.0 KB package) • Opera: Manifest V3 compatible (60.3 KB package)
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20

    hookprobe

    Free AI that blocks hackers while you sleep. Runs on cheap hardware

    HookProbe is an open-source AI-native intrusion detection system (IDS/IPS) that runs on Raspberry Pi and edge devices. It combines eBPF/XDP kernel-level packet filtering with machine learning threat classification to deliver autonomous network security with zero cloud dependency. The stack includes NAPSE (AI packet inspection), HYDRA (threat intelligence pipeline), SENTINEL (ML classification engine), and AEGIS (autonomous defense orchestrator).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Computer vision projects

    Computer vision projects

    computer vision projects | Fun AI projects related to computer vision

    Computer vision projects is an open-source collection of computer vision projects and experiments that demonstrate practical applications of modern AI techniques in image processing, robotics, and real-time visual analysis. The repository includes multiple demonstration systems implemented using languages such as Python and C++, covering topics ranging from object detection to embedded vision systems. Many of the projects illustrate how computer vision algorithms can interact with hardware platforms, including robotics systems and edge computing devices. The repository provides examples that combine machine learning models with real-world applications such as robotic arms, video analysis, and automated visual measurement systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    pineTERM: Free In-Browser UART Terminal

    pineTERM: Free In-Browser UART Terminal

    Easy-to-use, powerful web/browser UART terminal, no installation req.

    A modern, stand alone browser-based serial terminal for UART communication with microcontrollers, embedded devices, and IoT hardware. No installation required - just open in your browser and connect to your serial device. Just unpack the zip file into local folder and open index.html in Chrome/Edge/Opera. Works on Windows/Linux/MaOS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    FastViT

    FastViT

    This repository contains the official implementation of research

    FastViT is an efficient vision backbone family that blends convolutional inductive biases with transformer capacity to deliver strong accuracy at mobile and real-time inference budgets. Its design pursues a favorable latency-accuracy Pareto curve, targeting edge devices and server scenarios where throughput and tail latency matter. The models use lightweight attention and carefully engineered blocks to minimize token mixing costs while preserving representation power. Training and inference recipes highlight straightforward integration into common vision tasks such as classification, detection, and segmentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    ImageJ2x

    Java Image Processing Program

    ...It can measure distances and angles. It can record density histograms and line profiles. It supports standard image processing functions such as contrast manipulation, sharpening, smoothing, edge detection and filtering it through all kinds of geometric transformations such as Zoom in / out and rotation. Images can be enlarged up to 32
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    BEVFormer

    BEVFormer

    Implementation of BEVFormer, a camera-only framework

    3D visual perception tasks, including 3D detection and map segmentation based on multi-camera images, are essential for autonomous driving systems. In this work, we present a new framework termed BEVFormer, which learns unified BEV representations with spatiotemporal transformers to support multiple autonomous driving perception tasks. In a nutshell, BEVFormer exploits both spatial and temporal information by interacting with spatial and temporal space through predefined grid-shaped BEV...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB