Implementation of Make-A-Video, new SOTA text to video generator
PyTorch code and models for VJEPA2 self-supervised learning from video
Interactive video and image annotation tool for computer vision
The data structure for multimodal data
Deep Learning API and Server in C++14 support for Caffe, PyTorch
A GPU-accelerated library containing highly optimized building blocks
PyTorch code and models for V-JEPA self-supervised learning from video
A distributed system for embedding-based vector retrieval
Build cross-modal and multimodal applications on the cloud
Open Source Computer Vision Library
A computer vision framework to create and deploy apps in minutes
Face Mask Detection system based on computer vision and deep learning
Gluon CV Toolkit
Deep Learning (Flower Book) mathematical derivation
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
World's simplest facial recognition api for Python & the command line
A curated list of resources dedicated to RNN