A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Implementation of Make-A-Video, new SOTA text to video generator
PyTorch code and models for VJEPA2 self-supervised learning from video
Build cross-modal and multimodal applications on the cloud
PyTorch3D is FAIR's library of reusable components for deep learning
The data structure for multimodal data
Pre-trained Deep Learning models and demos
The Triton Inference Server provides an optimized cloud
Deep learning optimization library making distributed training easy
Making large AI models cheaper, faster and more accessible
PyTorch code and models for V-JEPA self-supervised learning from video
A Python library for audio data augmentation
MII makes low-latency and high-throughput inference possible
Open Source Computer Vision Library
A computer vision framework to create and deploy apps in minutes
Distributed training framework for TensorFlow, Keras, PyTorch, etc.
A flexible and efficient library for deep learning
Face Mask Detection system based on computer vision and deep learning
Gluon CV Toolkit
We estimate dense, flicker-free, geometrically consistent depth
Library of deep learning models and datasets
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Deep Learning (Flower Book) mathematical derivation
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Deep learning person re-identification in PyTorch