DeepMind model for tracking arbitrary points across videos & robotics
Uncommon Objects in 3D dataset
AIMET is a library that provides advanced quantization and compression
Image polygonal annotation with Python
Agent Zero AI framework
Official implementation of DreamCraft3D
PyTorch3D is FAIR's library of reusable components for deep learning
code for Mesh R-CNN, ICCV 2019
Fast and Universal 3D reconstruction model for versatile tasks
Motion-controllable Video Generation via Latent Trajectory Guidance
Pretrained time-series foundation model developed by Google Research
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Train machine learning models within Docker containers
Build resilient language agents as graphs
Open source personal AI Assistant for Linux, Windows and Mac
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
HexStrike AI MCP Agents is an advanced MCP server
21 Lessons, Get Started Building with Generative AI
Sample code and notebooks for Generative AI on Google Cloud
OpenDAN is an open source Personal AI OS
Utilities intended for use with Llama models
A template for development with the open-autonomy framework
Python SDK for agent monitoring, LLM cost tracking, benchmarking, etc.
Qwen2.5-VL is the multimodal large language model series