Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Large Multimodal Models for Video Understanding and Editing
An Efficient, Scalable, Multi-Modality RL Training Framework
FAIR Sequence Modeling Toolkit 2
GPT4V-level open-source multi-modal model based on Llama3-8B
A comprehensive set of fairness metrics for datasets
Probabilistic programming in Python
Official inference repo for FLUX.2 models
GLM-4 series: Open Multilingual Multimodal Chat LMs
Advanced language and coding AI model
Python inference and LoRA trainer package for the LTX-2 audio–video
Agentic, Reasoning, and Coding (ARC) foundation models
AlphaFold 3 inference pipeline
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Real-Time High-Resolution Background Matting
Towards Human-Level Text-to-Speech through Style Diffusion
Code for the paper Language Models are Unsupervised Multitask Learners
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Official PyTorch Implementation
A Unified Framework for Text-to-3D and Image-to-3D Generation
The official Meta Llama 3 GitHub site
PyTorch code and models for VJEPA2 self-supervised learning from video
The repository provides code for running inference with SAM 2
From Images to High-Fidelity 3D Assets