Sharp Monocular Metric Depth in Less Than a Second
Recovering the Visual Space from Any Views
A theoretical reconstruction of the Claude Mythos architecture
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Stable Diffusion built-in to Blender
ImageBind One Embedding Space to Bind Them All
[CVPR 2025 Best Paper Award] VGGT
Photorealistic Synthetic Dataset for Holistic Indoor Scene
Curated list of classic, high-quality computer science books
Qwen-Image is a powerful image generation foundation model
3D reconstruction software
Improve your Baduk skills by training with KataGo
RGBD video generation model conditioned on camera input
Drop-in replacement for standard residual connections in Transformers
Diffusion Transformer with Fine-Grained Chinese Understanding
AI video generator optimized for low VRAM and older GPUs use
An AI-powered research assistant that performs iterative research
Crawl a site to generate knowledge files to create your own custom GPT
Tooling for the Common Objects In 3D dataset
In-depth tutorials on LLMs, RAGs and real-world AI agent applications
Fast and Universal 3D reconstruction model for versatile tasks
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Functional Machine Learning
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Perplexica is an AI-powered answering engine.