Reference PyTorch implementation and models for DINOv3
Example Discord bot written in Python that uses the completions API
Wan2.2: Open and Advanced Large-Scale Video Generative Model
FAIR Sequence Modeling Toolkit 2
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Global weather forecasting model using graph neural networks and JAX
A Unified Framework for Text-to-3D and Image-to-3D Generation
Inference framework for 1-bit LLMs
High-resolution models for human tasks
RGBD video generation model conditioned on camera input
Generating Immersive, Explorable, and Interactive 3D Worlds
ChatGPT interface with better UI
Video understanding codebase from FAIR for reproducing video models
Multimodal-Driven Architecture for Customized Video Generation
code for Mesh R-CNN, ICCV 2019
A Customizable Image-to-Video Model based on HunyuanVideo
Official implementation of DreamCraft3D
Unified Multimodal Understanding and Generation Models
A Powerful Native Multimodal Model for Image Generation
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Official implementation of Watermark Anything with Localized Messages
Personalize Any Characters with a Scalable Diffusion Transformer
4M: Massively Multimodal Masked Modeling
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch code and models for the DINOv2 self-supervised learning