An extensive node suite that enables ComfyUI to process 3D inputs
Generating Immersive, Explorable, and Interactive 3D Worlds
3D Computer Vision Framework
OpenClaw Office is the visual monitoring and management frontend
Recovering the Visual Space from Any Views
A lightweight 3D Morphable Face Model library in modern C++
PyTorch3D is FAIR's library of reusable components for deep learning
Claw3D is an open source 3D engine built on OpenClaw
Unifying 3D Mesh Generation with Language Models
Crafting engine for artists, designers, and filmmakers
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Benchmarking Multimodal Agents for Open-Ended Tasks
SAPIEN Manipulation Skill Framework
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Workflow and speech recognition app
Doom-based AI research platform for reinforcement learning
Python inference and LoRA trainer package for the LTX-2 audio–video
End-to-end pipeline converting generative videos
PaddlePaddle End-to-End Development Toolkit
From Addition, Subtraction, Multiplication, and Division to ML
Modular quant framework
Visual localization made easy with hloc
A library for graph deep learning research