Implementation of Vision Transformer, a simple way to achieve SOTA
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
Codebase for Image Classification Research, written in PyTorch
A modular framework for vision & language multimodal research
Fast, modular reference implementation of Instance Segmentation
A curated list of resources dedicated to RNN
Computer vision and image processing library for Qt.