Wan2.2: Open and Advanced Large-Scale Video Generative Model
From Images to High-Fidelity 3D Assets
High-Resolution Image Synthesis with Latent Diffusion Models
An experimental version of DeepSeek model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Dataset of GPT-2 outputs for research in detection, biases, and more
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Qwen2.5-VL is the multimodal large language model series
gpt-oss-120b and gpt-oss-20b are two open-weight language models
DeepSeek Coder: Let the Code Write Itself
State-of-the-art TTS model under 25MB
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Tooling for the Common Objects In 3D dataset
Video understanding codebase from FAIR for reproducing video models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
FAIR Sequence Modeling Toolkit 2
VMZ: Model Zoo for Video Modeling
Global weather forecasting model using graph neural networks and JAX
Tool for exploring and debugging transformer model behaviors
Code for running inference and finetuning with SAM 3 model
Repo of Qwen2-Audio chat & pretrained large audio language model
A Production-ready Reinforcement Learning AI Agent Library
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Implementation of the Surya Foundation Model for Heliophysics
High-Resolution Image Synthesis with Latent Diffusion Models