video image extractor free download

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu...

1 Review

Downloads: 43 This Week

Last Update: 2026-04-05

See Project

Make-A-Video - Pytorch (wip)

Implementation of Make-A-Video, new SOTA text to video generator

...The gist of the paper comes down to, take a SOTA text-to-image model (here they use DALL-E2, but the same learning points would easily apply to Imagen), make a few minor modifications for attention across time and other ways to skimp on the compute cost, do frame interpolation correctly, get a great video model out. Passing in images (if one were to pretrain on images first), both temporal convolution and attention will be automatically skipped.

Downloads: 1 This Week

Last Update: 2024-05-03

See Project

Computer Vision Annotation Tool (CVAT)

Interactive video and image annotation tool for computer vision

Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for...

Downloads: 26 This Week

Last Update: 2026-04-30

See Project

DeepDetect

Deep Learning API and Server in C++14 support for Caffe, PyTorch

...While the Open Source Deep Learning Server is the core element, with REST API, and multi-platform support that allows training & inference everywhere, the Deep Learning Platform allows higher level management for training neural network models and using them as if they were simple code snippets. Ready for applications of image tagging, object detection, segmentation, OCR, Audio, Video, Text classification, CSV for tabular data and time series. Neural network templates for the most effective architectures for GPU, CPU, and Embedded devices. Training in a few hours and with small data thanks to 25+ pre-trained models. Full Open Source, with an ecosystem of tools (API clients, video, annotation, ...) ...

Downloads: 1 This Week

Last Update: 2025-07-19

See Project

DocArray

The data structure for multimodal data

DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. ...

Downloads: 0 This Week

Last Update: 2025-03-21

See Project

DALI

A GPU-accelerated library containing highly optimized building blocks

The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding, cropping, resizing, and many other augmentations. These data processing pipelines, which are currently executed on the CPU, have become a bottleneck, limiting the performance and scalability of training and inference. ...

Downloads: 1 This Week

Last Update: 2026-04-16

See Project

JEPA

PyTorch code and models for V-JEPA self-supervised learning from video

JEPA (Joint-Embedding Predictive Architecture) captures the idea of predicting missing high-level representations rather than reconstructing pixels, aiming for robust, scalable self-supervised learning. A context encoder ingests visible regions and predicts target embeddings for masked regions produced by a separate target encoder, avoiding low-level reconstruction losses that can overfit to texture. This makes learning focus on semantics and structure, yielding features that transfer well...

Downloads: 0 This Week

Last Update: 2025-10-07

See Project

Vearch

A distributed system for embedding-based vector retrieval

...End-to-end one-click deployment. Through the module of the plugin, a complete default visual search system can be deployed just with one click. Otherwise, you can easily customize your own image, video, or text feature extraction algorithm plugin. This GIF provides a clear demonstration of the project vearch usage and its internal structure. The use of vearch is mainly divided into three steps. Firstly, create DB and Space, then import your data, and finally, you can search on your own dataset.

Downloads: 0 This Week

Last Update: 2026-02-04

See Project

Jina

Build cross-modal and multimodal applications on the cloud

...Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP, GraphQL protocols with TLS. Intuitive design pattern for high-performance microservices. Seamless Docker container integration: sharing, exploring, sandboxing, versioning and dependency control via Jina Hub. Fast deployment to Kubernetes, Docker Compose and Jina Cloud. ...

Downloads: 0 This Week

Last Update: 2024-11-12

See Project

Gluon CV Toolkit

...It features training scripts that reproduce SOTA results reported in latest papers, a large set of pre-trained models, carefully designed APIs and easy-to-understand implementations and community support. From fundamental image classification, object detection, semantic segmentation and pose estimation, to instance segmentation and video action recognition. The model zoo is the one-stop shopping center for many models you are expecting. GluonCV embraces a flexible development pattern while is super easy to optimize and deploy without retaining a heavyweight deep learning framework.

Downloads: 0 This Week

Last Update: 2021-11-01

See Project

Torchreid

Deep learning person re-identification in PyTorch

Torchreid is a library for deep-learning person re-identification, written in PyTorch and developed for our ICCV’19 project, Omni-Scale Feature Learning for Person Re-Identification. In "deep-person-reid/scripts/", we provide a unified interface to train and test a model. See "scripts/main.py" and "scripts/default_config.py" for more details. The folder "configs/" contains some predefined configs which you can use as a starting point. The code will automatically (download and) load the...

Downloads: 1 This Week

Last Update: 2022-08-05

See Project

Awesome Recurrent Neural Networks

A curated list of resources dedicated to RNN

A curated list of resources dedicated to recurrent neural networks (closely related to deep learning). Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn...

Downloads: 0 This Week

Last Update: 2021-09-22

See Project

Search Results for "video image extractor"

Showing 12 open source projects for "video image extractor"

Video-subtitle-extractor

Make-A-Video - Pytorch (wip)

Computer Vision Annotation Tool (CVAT)

DeepDetect

DocArray

DALI

JEPA

Vearch

Jina

Gluon CV Toolkit

Torchreid

Awesome Recurrent Neural Networks

Search Results for "video image extractor"

Showing 12 open source projects for "video image extractor"

Video-subtitle-extractor

Make-A-Video - Pytorch (wip)

Computer Vision Annotation Tool (CVAT)

DeepDetect

DocArray

DALI

JEPA

Vearch

Jina

Gluon CV Toolkit

Torchreid

Awesome Recurrent Neural Networks

Related Searches

Related Categories