MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Language modeling in a sentence representation space
PyTorch code and models for the DINOv2 self-supervised learning
An AI-powered security review GitHub Action using Claude
Example Discord bot written in Python that uses the completions API
Programmatic access to the AlphaGenome model
A Unified Framework for Text-to-3D and Image-to-3D Generation
Chat & pretrained large vision language model
Designed for text embedding and ranking tasks
AlphaFold 3 inference pipeline
Implementation of "MobileCLIP" CVPR 2024
4M: Massively Multimodal Masked Modeling
Sharp Monocular Metric Depth in Less Than a Second
This repository contains the official implementation of FastVLM
code for Mesh R-CNN, ICCV 2019
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Uncommon Objects in 3D dataset
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
A Production-ready Reinforcement Learning AI Agent Library
Video understanding codebase from FAIR for reproducing video models
Hackable and optimized Transformers building blocks
Provides convenient access to the Anthropic REST API from any Python 3
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Official implementation of DreamCraft3D