Implementation of Make-A-Video, new SOTA text to video generator
PyTorch code and models for VJEPA2 self-supervised learning from video
Interactive video and image annotation tool for computer vision
C++ library for high performance inference on NVIDIA GPUs
The data structure for multimodal data
Deep Learning API and Server in C++14 support for Caffe, PyTorch
A GPU-accelerated library containing highly optimized building blocks
MNN is a blazing fast, lightweight deep learning framework
PyTorch code and models for V-JEPA self-supervised learning from video
A distributed system for embedding-based vector retrieval
Build cross-modal and multimodal applications on the cloud
Open Source Computer Vision Library
A computer vision framework to create and deploy apps in minutes
Face Mask Detection system based on computer vision and deep learning
Gluon CV Toolkit
We estimate dense, flicker-free, geometrically consistent depth
Best Practices, code samples, and documentation for Computer Vision
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Deep Learning (Flower Book) mathematical derivation
World's simplest facial recognition api for Python & the command line
A curated list of resources dedicated to RNN