Showing 36 open source projects for "video annotation"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    labelme Image Polygonal Annotation

    labelme Image Polygonal Annotation

    Image polygonal annotation with Python

    Labelme is a graphical image annotation tool. It is written in Python and uses Qt for its graphical interface. Image annotation for polygon, rectangle, circle, line and point. Image flag annotation for classification and cleaning. Video annotation. (video annotation). GUI customization (predefined labels / flags, auto-saving, label validation, etc). Exporting VOC-format dataset for semantic/instance segmentation.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    Computer Vision Annotation Tool (CVAT)

    Computer Vision Annotation Tool (CVAT)

    Interactive video and image annotation tool for computer vision

    Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for...
    Downloads: 34 This Week
    Last Update:
    See Project
  • 3
    X-AnyLabeling

    X-AnyLabeling

    Effortless data labeling with AI support from Segment Anything

    X-AnyLabeling is an open-source data annotation platform designed to streamline the process of labeling datasets for computer vision and multimodal AI applications. The software integrates an AI-powered labeling engine that allows users to generate annotations automatically with the assistance of modern vision models such as Segment Anything and various object detection frameworks. It supports labeling tasks across images and videos and enables developers to prepare training datasets for...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 4
    Label Studio

    Label Studio

    Label Studio is a multi-type data labeling and annotation tool

    ...It can be used to prepare raw data or improve existing training data to get more accurate ML models. The frontend part of Label Studio app lies in the frontend/ folder and written in React JSX. Multi-user labeling sign up and login, when you create an annotation it's tied to your account. Configurable label formats let you customize the visual interface to meet your specific labeling needs. Support for multiple data types including images, audio, text, HTML, time-series, and video.
    Downloads: 24 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    Director

    Director

    AI video agents framework for next-gen video interactions

    Director is a video database management system designed to organize, search, and retrieve large collections of video content efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    HunyuanWorld-Voyager

    HunyuanWorld-Voyager

    RGBD video generation model conditioned on camera input

    HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 7
    Ovmeet

    Ovmeet

    Video conferencing and collaboration platform

    OvMeet is a video conferencing and collaboration platform developed in China that supports video meetings and H5 web/video live streaming. WebRTC, RTMP, SIP, RTSP, whiteboards, document presentation, file sharing, desktop sharing, recording, and more. The older version was built using Adobe/Flash, but that is no longer maintained. The newer version uses modern web technologies to deliver video conferencing services across Web, H5, Android, iOS, PC, etc. It also supports server...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    Screenity is a feature-packed screen and camera recorder for Chrome. Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9
    DeepDetect

    DeepDetect

    Deep Learning API and Server in C++14 support for Caffe, PyTorch

    ...While the Open Source Deep Learning Server is the core element, with REST API, and multi-platform support that allows training & inference everywhere, the Deep Learning Platform allows higher level management for training neural network models and using them as if they were simple code snippets. Ready for applications of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) Fast Server written in pure C++, a single codebase for Cloud, Desktop & Embedded.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Scriberr

    Scriberr

    Self-hosted AI audio transcription

    Scriberr is a self-hosted AI-powered transcription platform designed to convert audio and video into highly accurate text while prioritizing privacy and local processing. Unlike cloud-based transcription services, Scriberr runs entirely on the user’s machine, ensuring that sensitive recordings are never sent to third-party servers and remain fully under user control. It leverages modern speech recognition models such as Whisper and other advanced architectures to deliver precise transcripts...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SAHI

    SAHI

    A lightweight vision library for performing large object detection

    A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    mrv2

    mrv2

    mrv2 - Professional player and review tool for VFX, animation and CGI

    mrv2 is the second generation of the popular review tool mrViewer. It is faster, adds python support, networking with path mappings, etc.
    Leader badge
    Downloads: 71 This Week
    Last Update:
    See Project
  • 14
    DataGym.ai

    DataGym.ai

    Open source annotation and labeling tool for image and video assets

    DATAGYM enables data scientists and machine learning experts to label images up to 10x faster. AI-assisted annotation tools reduce manual labeling effort, give you more time to finetune ML models and speed up your go to market of new products. Accelerate your computer vision projects by cutting down data preparation time up to 50%. A machine learning model is only as good as its training data. DATAGYM is an end-to-end workbench to create, annotate, manage, and export the right training data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Free Screen Recorder for Windows 10

    Effortlessly record and capture pc screen with this screen recorder

    Many people use screen recording to produce tutorials, record gameplay, or capture a video call. There are numerous solutions available, each with its features and benefits. And you're in luck if you're browsing for a free and trustworthy screen recorder for Windows 10 because this software is one of Windows's best free screen recorders. Indeed, professional streamers, YouTubers, and gamers worldwide use this powerful software. It allows you to seamlessly record your screen, webcam, and...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    DensePose

    DensePose

    A real-time approach for mapping all human pixels of 2D RGB images

    DensePose is a computer vision system that maps all human pixels in an RGB image to the 3D surface of a human body model. It extends human pose estimation from predicting joint keypoints to providing dense correspondences between 2D images and a canonical 3D mesh (such as the SMPL model). This enables detailed understanding of human shape, motion, and surface appearance directly from images or videos. The repository includes the DensePose network architecture, training code, pretrained...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    VoTT

    VoTT

    Visual Object Tagging Tool, an electron app for building models

    Visual Object Tagging Tool: An electron app for building end-to-end Object Detection Models from Images and Videos. An open source annotation and labeling tool for image and video assets. VoTT is a React + Redux Web application, written in TypeScript. This project was bootstrapped with Create React App. VoTT can be installed as a native application or run from source. VoTT is also available as a stand-alone Web application and can be used in any modern Web browser. VoTT is available for Windows, Linux and OSX. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    The Video Annotation and Reference System (VARS) is a software interface and database system that provides tools for describing, cataloging, retrieving, and viewing the visual, descriptive, and quantitative data associated with video.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Vianto

    Vianto

    Video Annotation Tool

    See https://sourceforge.net/p/vianto/wiki/ for version features and feature requests. Vianto is a Java-based video annotation / coding tool with graphical user interface that allows you to: - Record video (in OSX only) - Save and load markers to code the video with (timestamps automatically generated for events) - Double click on events and the video will jump to the right place in the video - Click a marker to select start time, click again to set end time of code or preset a plus/minus time (in seconds) - Wildcard code to input free text - Compare multiple codings and create a consolidated set of events - Link multiple videos together to view multiple angles shot from different cameras at the same time Built with VLCj and packaged to run on Windows and OSX without the need to install VLC. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Sensory Effect Video Annotation

    Annotation tool for sensory effects and multimedia content

    This program provides means for annotating multimedia content (e.g., videos) with sensory effects based on the MPEG-V: Media Context and Control standard, especially Part 3: Sensory Information (ISO/IEC 23005-3). Now with integrated Sensory Effect Simulator (SESim).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Sensory Effect Simulator

    Simulator for sensory effect and multimedia content

    This program provides means for simulating multimedia content (e.g., videos) with sensory effects based on the MPEG-V: Media Context and Control standard, especially Part 3: Sensory Information (ISO/IEC 23005-3). NOTE: SESim will not be developed any further as it is now integrated in the Sensory Effect Video Annotation (SEVino) tool: https://sourceforge.net/projects/sevino/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GenomeRunner

    GenomeRunner

    Annotation and enrichment of Next-Gen sequencing data

    Note: This version requires additional SQLite database files. Contact the developers to obtain them. Use http://www.integrativegenomics.org/ for the latest data and analyses. GenomeRunner is a tool for automating genome exploration. It performs annotation and enrichment analyses of user-provided genomic regions (SNPs, ChIP-seq binding sites etc.) against >6,000 (human genome) epigenomic features available from the UCSC genome browser. Input - any genome-wide data data in .bed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project is dedicated for semantic annotation of video and image files and development of a unified page ranking for them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MacEval
    MacEval is part of an Evaluation Framework Suite to support usability evaluators with means of performing low-cost usability evaluations. The tool allows the evaluator to record, evaluate, and analyze data user tests.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    VidL is an online video and image annotation program that allows for the management of workers and data via a webserver. Annotate videos and images with meta data or other time related markers. Use include research data collection and annotation, video i
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB