Sharp Monocular Metric Depth in Less Than a Second
AI tool converting video/audio into structured documents instantly
Recovering the Visual Space from Any Views
Open source drag-and-drop reporting and dashboard builder platform
Use your most capable model to audit your codebase
A smart, powerful, and beautiful excalidraw drawing tool
Synthesizing and manipulating 2048x1024 images with conditional GANs
Roadmap to becoming an Artificial Intelligence Expert in 2022
Multimodal embedding and reranking models built on Qwen3-VL
Simple and easily configurable grid world environments
A Unified Framework for Text-to-3D and Image-to-3D Generation
A Protocol for Agent-Driven Interfaces
A tension reasoning engine over 131 S-class problems
Gateway service that instantly transforms existing MCP Servers
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A timeline of the latest AI models for audio generation
Generate 3D objects conditioned on text or images
Let us control diffusion models
High-Resolution 3D Human Digitization from A Single Image
Class Activation Mapping
Conditional Variational Autoencoder with Adversarial Learning
A real-time approach for mapping all human pixels of 2D RGB images
Adversarial Latent Autoencoders
Pytorch implementation of our method for high-resolution
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201