VGGSfM: Visual Geometry Grounded Deep Structure From Motion
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
Motion-controllable Video Generation via Latent Trajectory Guidance
PaddlePaddle End-to-End Development Toolkit
Modular quant framework
Gemma open-weight LLM library, from Google DeepMind
Flexible Photo Recrafting While Preserving Your Identity
Large-language-model & vision-language-model based on Linear Attention
Learning multi-scale deep model correcting over- and under- exposed
Implementation of BEVFormer, a camera-only framework
PyTorch implementation of MAE
Tool for building chat bots, apps and custom integrations
PyTorch implementation of MoCo v3
Tool for building conversation applications
PyTorch implementation of SimCLR: A Simple Framework
Visual tracking library based on PyTorch
High-fidelity indoor 3D dataset for AI simulation and robotics
Training a Correlation Filter end-to-end allows lightweight networks
Scientific computing, machine learning and computer vision for .NET