Structure-from-Motion and Multi-View Stereo
Tooling for the Common Objects In 3D dataset
DeepMind model for tracking arbitrary points across videos & robotics
Uncommon Objects in 3D dataset
Official implementation of DreamCraft3D
Image polygonal annotation with Python
An open source implementation of CLIP
Data Lake for Deep Learning. Build, manage, and query datasets
An unsupervised and free tool for image and video dataset analysis
Provides code for running inference with the SegmentAnything Model
Sample code and notebooks for Generative AI on Google Cloud
Modern columnar data format for ML and LLMs implemented in Rust
Qwen2.5-VL is the multimodal large language model series
Global weather forecasting model using graph neural networks and JAX
Standalone, small, language-neutral
Pretrained time-series foundation model developed by Google Research
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
code for Mesh R-CNN, ICCV 2019
Query and Summarize your chat messages
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
A self-hostable CDN for databases
Unified web UI for training and running open models locally
Application implementation with business use cases
MCP Aggregator, Orchestrator, Middleware, Gateway in one docker
VGGSfM: Visual Geometry Grounded Deep Structure From Motion