Wan2.2: Open and Advanced Large-Scale Video Generative Model
From Images to High-Fidelity 3D Assets
Dataset of GPT-2 outputs for research in detection, biases, and more
An experimental version of DeepSeek model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Qwen2.5-VL is the multimodal large language model series
gpt-oss-120b and gpt-oss-20b are two open-weight language models
State-of-the-art TTS model under 25MB
DeepSeek Coder: Let the Code Write Itself
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Code for running inference and finetuning with SAM 3 model
Video understanding codebase from FAIR for reproducing video models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
FAIR Sequence Modeling Toolkit 2
Tool for exploring and debugging transformer model behaviors
Repo of Qwen2-Audio chat & pretrained large audio language model
Implementation of the Surya Foundation Model for Heliophysics
High-Resolution Image Synthesis with Latent Diffusion Models
Suite with Real-ESRGAN, BSRGAN , IRCNN, GFPGAN & RIFE. v4.3
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
800,000 step-level correctness labels on LLM solutions to MATH problem
An implementation of model parallel GPT-2 and GPT-3-style models
Large-scale autoregressive pixel model for image generation by OpenAI
Vision-language-action model for robot control via images and text