Implementation of Nougat Neural Optical Understanding
Visual localization made easy with hloc
Task-oriented finetuning for better embeddings on neural search
Implementation of BEVFormer, a camera-only framework
Visual analysis and diagnostic tools to facilitate ML selection
Code release for ConvNeXt model
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Generative Adversarial Transformers
Implementation of Deep Feature Rotation for Multimodal Image
PyTorch implementation of MoCo v3
All-in-one web-based IDE specialized for machine learning
A real-time approach for mapping all human pixels of 2D RGB images
PyTorch implementation of SimCLR: A Simple Framework
Constantly summarizing open source dataset and critical papers
We estimate dense, flicker-free, geometrically consistent depth
Visual tracking library based on PyTorch
Compute FID scores with PyTorch
A starter agent that can solve a number of universe environments
Cross Audio-Visual Recognition using 3D Architectures
Aims to enable researcher to tap in to mobile computing capability
Scripthea is designed to streamline of crafting prompts for T2I gen.