A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Implementation of Make-A-Video, new SOTA text to video generator
PyTorch code and models for VJEPA2 self-supervised learning from video
Build cross-modal and multimodal applications on the cloud
Interactive video and image annotation tool for computer vision
C++ library for high performance inference on NVIDIA GPUs
PyTorch3D is FAIR's library of reusable components for deep learning
Techniques for deep learning with satellite & aerial imagery
The data structure for multimodal data
Deep Learning API and Server in C++14 support for Caffe, PyTorch
MNN is a blazing fast, lightweight deep learning framework
Pre-trained Deep Learning models and demos
A GPU-accelerated library containing highly optimized building blocks
The Triton Inference Server provides an optimized cloud
Deep learning at the speed of light
Deep learning optimization library making distributed training easy
Making large AI models cheaper, faster and more accessible
PyTorch code and models for V-JEPA self-supervised learning from video
Toolkit for making machine learning and data analysis applications
A Python library for audio data augmentation
MII makes low-latency and high-throughput inference possible
A distributed system for embedding-based vector retrieval
Open Source Computer Vision Library
CometAnalyser, for quantitative comet assay analysis.
A computer vision framework to create and deploy apps in minutes