video annotation free download

Showing 36 open source projects for "video annotation"

View related business solutions

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
1

labelme Image Polygonal Annotation

Image polygonal annotation with Python

Labelme is a graphical image annotation tool. It is written in Python and uses Qt for its graphical interface. Image annotation for polygon, rectangle, circle, line and point. Image flag annotation for classification and cleaning. Video annotation. (video annotation). GUI customization (predefined labels / flags, auto-saving, label validation, etc). Exporting VOC-format dataset for semantic/instance segmentation.

Downloads: 27 This Week

Last Update: 23 hours ago
See Project
2

Computer Vision Annotation Tool (CVAT)

Interactive video and image annotation tool for computer vision

Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for...

Downloads: 34 This Week

Last Update: 2 days ago
See Project
3

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything

X-AnyLabeling is an open-source data annotation platform designed to streamline the process of labeling datasets for computer vision and multimodal AI applications. The software integrates an AI-powered labeling engine that allows users to generate annotations automatically with the assistance of modern vision models such as Segment Anything and various object detection frameworks. It supports labeling tasks across images and videos and enables developers to prepare training datasets for...

Downloads: 36 This Week

Last Update: 5 days ago
See Project
4

Label Studio

Label Studio is a multi-type data labeling and annotation tool

...It can be used to prepare raw data or improve existing training data to get more accurate ML models. The frontend part of Label Studio app lies in the frontend/ folder and written in React JSX. Multi-user labeling sign up and login, when you create an annotation it's tied to your account. Configurable label formats let you customize the visual interface to meet your specific labeling needs. Support for multiple data types including images, audio, text, HTML, time-series, and video.

Downloads: 16 This Week

Last Update: 2026-03-13
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

Director

AI video agents framework for next-gen video interactions

Director is a video database management system designed to organize, search, and retrieve large collections of video content efficiently.

Downloads: 0 This Week

Last Update: 2025-01-29
See Project
6

HunyuanWorld-Voyager

RGBD video generation model conditioned on camera input

HunyuanWorld-Voyager is a next-generation video diffusion framework developed by Tencent-Hunyuan for generating world-consistent 3D scene videos from a single input image. By leveraging user-defined camera paths, it enables immersive scene exploration and supports controllable video synthesis with high realism. The system jointly produces aligned RGB and depth video sequences, making it directly applicable to 3D reconstruction tasks. At its core, Voyager integrates a world-consistent video...

Downloads: 19 This Week

Last Update: 2026-04-15
See Project
7

Ovmeet

Video conferencing and collaboration platform

OvMeet is a video conferencing and collaboration platform developed in China that supports video meetings and H5 web/video live streaming. WebRTC, RTMP, SIP, RTSP, whiteboards, document presentation, file sharing, desktop sharing, recording, and more. The older version was built using Adobe/Flash, but that is no longer maintained. The newer version uses modern web technologies to deliver video conferencing services across Web, H5, Android, iOS, PC, etc. It also supports server...

Downloads: 2 This Week

Last Update: 2026-03-26
See Project
8

Screenity

The most powerful screen recorder & annotation tool for Chrome

Screenity is a feature-packed screen and camera recorder for Chrome. Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show...

1 Review

Downloads: 15 This Week

Last Update: 1 day ago
See Project
9

Scriberr

Self-hosted AI audio transcription

Scriberr is a self-hosted AI-powered transcription platform designed to convert audio and video into highly accurate text while prioritizing privacy and local processing. Unlike cloud-based transcription services, Scriberr runs entirely on the user’s machine, ensuring that sensitive recordings are never sent to third-party servers and remain fully under user control. It leverages modern speech recognition models such as Whisper and other advanced architectures to deliver precise transcripts...

Downloads: 3 This Week

Last Update: 2026-03-19
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
10

DeepDetect

Deep Learning API and Server in C++14 support for Caffe, PyTorch

...While the Open Source Deep Learning Server is the core element, with REST API, and multi-platform support that allows training & inference everywhere, the Deep Learning Platform allows higher level management for training neural network models and using them as if they were simple code snippets. Ready for applications of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) Fast Server written in pure C++, a single codebase for Cloud, Desktop & Embedded.

Downloads: 1 This Week

Last Update: 2025-07-19
See Project
11

AudioNotes

Extract audio and video content and organize it into a Markdown note

AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. As an...

Downloads: 0 This Week

Last Update: 2025-12-04
See Project
12

SAHI

A lightweight vision library for performing large object detection

A lightweight vision library for performing large-scale object detection & instance segmentation. Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems with many vision utilities. Detection of small objects and objects far away in the scene is a major...

Downloads: 0 This Week

Last Update: 2025-09-28
See Project
13

mrv2

mrv2 - Professional player and review tool for VFX, animation and CGI

mrv2 is the second generation of the popular review tool mrViewer. It is faster, adds python support, networking with path mappings, etc.

Downloads: 74 This Week

Last Update: 2026-04-16
See Project
14

DataGym.ai

Open source annotation and labeling tool for image and video assets

DATAGYM enables data scientists and machine learning experts to label images up to 10x faster. AI-assisted annotation tools reduce manual labeling effort, give you more time to finetune ML models and speed up your go to market of new products. Accelerate your computer vision projects by cutting down data preparation time up to 50%. A machine learning model is only as good as its training data. DATAGYM is an end-to-end workbench to create, annotate, manage, and export the right training data...

Downloads: 0 This Week

Last Update: 2023-06-01
See Project
15

Free Screen Recorder for Windows 10

Effortlessly record and capture pc screen with this screen recorder

Many people use screen recording to produce tutorials, record gameplay, or capture a video call. There are numerous solutions available, each with its features and benefits. And you're in luck if you're browsing for a free and trustworthy screen recorder for Windows 10 because this software is one of Windows's best free screen recorders. Indeed, professional streamers, YouTubers, and gamers worldwide use this powerful software. It allows you to seamlessly record your screen, webcam, and...

Downloads: 7 This Week

Last Update: 2023-04-03
See Project
16

DensePose

A real-time approach for mapping all human pixels of 2D RGB images

DensePose is a computer vision system that maps all human pixels in an RGB image to the 3D surface of a human body model. It extends human pose estimation from predicting joint keypoints to providing dense correspondences between 2D images and a canonical 3D mesh (such as the SMPL model). This enables detailed understanding of human shape, motion, and surface appearance directly from images or videos. The repository includes the DensePose network architecture, training code, pretrained...

Downloads: 8 This Week

Last Update: 2025-10-06
See Project
17

VoTT

Visual Object Tagging Tool, an electron app for building models

Visual Object Tagging Tool: An electron app for building end-to-end Object Detection Models from Images and Videos. An open source annotation and labeling tool for image and video assets. VoTT is a React + Redux Web application, written in TypeScript. This project was bootstrapped with Create React App. VoTT can be installed as a native application or run from source. VoTT is also available as a stand-alone Web application and can be used in any modern Web browser. VoTT is available for Windows, Linux and OSX. ...

1 Review

Downloads: 6 This Week

Last Update: 2022-08-02
See Project
18

Video Annotation and Reference System

The Video Annotation and Reference System (VARS) is a software interface and database system that provides tools for describing, cataloging, retrieving, and viewing the visual, descriptive, and quantitative data associated with video.

1 Review

Downloads: 2 This Week

Last Update: 2019-11-19
See Project
19

Vianto

Video Annotation Tool

See https://sourceforge.net/p/vianto/wiki/ for version features and feature requests. Vianto is a Java-based video annotation / coding tool with graphical user interface that allows you to: - Record video (in OSX only) - Save and load markers to code the video with (timestamps automatically generated for events) - Double click on events and the video will jump to the right place in the video - Click a marker to select start time, click again to set end time of code or preset a plus/minus time (in seconds) - Wildcard code to input free text - Compare multiple codings and create a consolidated set of events - Link multiple videos together to view multiple angles shot from different cameras at the same time Built with VLCj and packaged to run on Windows and OSX without the need to install VLC. ...

Downloads: 0 This Week

Last Update: 2016-11-05
See Project
20

Sensory Effect Video Annotation

Annotation tool for sensory effects and multimedia content

This program provides means for annotating multimedia content (e.g., videos) with sensory effects based on the MPEG-V: Media Context and Control standard, especially Part 3: Sensory Information (ISO/IEC 23005-3). Now with integrated Sensory Effect Simulator (SESim).

Downloads: 0 This Week

Last Update: 2014-11-26
See Project
21

Sensory Effect Simulator

Simulator for sensory effect and multimedia content

This program provides means for simulating multimedia content (e.g., videos) with sensory effects based on the MPEG-V: Media Context and Control standard, especially Part 3: Sensory Information (ISO/IEC 23005-3). NOTE: SESim will not be developed any further as it is now integrated in the Sensory Effect Video Annotation (SEVino) tool: https://sourceforge.net/projects/sevino/

Downloads: 0 This Week

Last Update: 2014-11-26
See Project
22

GenomeRunner

Annotation and enrichment of Next-Gen sequencing data

Note: This version requires additional SQLite database files. Contact the developers to obtain them. Use http://www.integrativegenomics.org/ for the latest data and analyses. GenomeRunner is a tool for automating genome exploration. It performs annotation and enrichment analyses of user-provided genomic regions (SNPs, ChIP-seq binding sites etc.) against >6,000 (human genome) epigenomic features available from the UCSC genome browser. Input - any genome-wide data data in .bed...

1 Review

Downloads: 0 This Week

Last Update: 2016-10-27
See Project
23

Unified Video Page Ranking

This project is dedicated for semantic annotation of video and image files and development of a unified page ranking for them.

Downloads: 0 This Week

Last Update: 2016-07-30
See Project
24

MacEval

MacEval is part of an Evaluation Framework Suite to support usability evaluators with means of performing low-cost usability evaluations. The tool allows the evaluator to record, evaluate, and analyze data user tests.

Downloads: 0 This Week

Last Update: 2013-04-22
See Project
25

VidL

VidL is an online video and image annotation program that allows for the management of workers and data via a webserver. Annotate videos and images with meta data or other time related markers. Use include research data collection and annotation, video i

Downloads: 1 This Week

Last Update: 2013-04-24
See Project