Implementation of Vision Transformer, a simple way to achieve SOTA
Official DeiT repository
ICLR2024 Spotlight: curation/training code, metadata, distribution
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Reference PyTorch implementation and models for DINOv3
[CVPR 2025 Best Paper Award] VGGT
PyTorch code and models for the DINOv2 self-supervised learning
Contexts Optical Compression
Benchmarking Multimodal Agents for Open-Ended Tasks
Foundational Models for State-of-the-Art Speech and Text Translation
fast C++ library for linear algebra & scientific computing
High-Resolution 3D Human Digitization from A Single Image
Resources to learn computer science in your spare time
PyTorch implementation of MAE
Codebase for Image Classification Research, written in PyTorch
Efficient 3D human pose estimation in video using 2D keypoint
Fast, modular reference implementation of Instance Segmentation
Matlab codes for feature learning
**MOVED TO GITHUB** ==> https://github.com/MRPT/mrpt
Efficient Approximate Nearest Neighbors for General Metric Spaces
Analysis of undersea fish videos
Computer vision and image processing library for Qt.