Multimodal-Driven Architecture for Customized Video Generation
A collection of reference Jupyter notebooks and demo AI/ML application
A Powerful Native Multimodal Model for Image Generation
Advanced evolutionary computation library built on top of PyTorch
A Customizable Image-to-Video Model based on HunyuanVideo
SAPIEN Manipulation Skill Framework
Official implementation of DreamCraft3D
Meta Agents Research Environments is a comprehensive platform
MemU is an open-source memory framework for AI companions
Unified Multimodal Understanding and Generation Models
The best ChatGPT that $100 can buy
PyTorch code and models for VJEPA2 self-supervised learning from video
4M: Massively Multimodal Masked Modeling
ICLR2024 Spotlight: curation/training code, metadata, distribution
[CVPR 2025 Best Paper Award] VGGT
PyTorch code and models for the DINOv2 self-supervised learning
Educational framework exploring multi-agent orchestration
FAIR's research platform for object detection research
RL implementations
A Customizable Image-to-Video Model based on HunyuanVideo
Optimized Workforce Learning for General Multi-Agent Assistance
Open-Source Framework for Distributed Constraint Optimization (DCOP)
A computer vision framework to create and deploy apps in minutes
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Repo for external large-scale work