Wan2.2: Open and Advanced Large-Scale Video Generative Model
From Images to High-Fidelity 3D Assets
Dataset of GPT-2 outputs for research in detection, biases, and more
High-Resolution Image Synthesis with Latent Diffusion Models
An experimental version of DeepSeek model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Qwen2.5-VL is the multimodal large language model series
gpt-oss-120b and gpt-oss-20b are two open-weight language models
State-of-the-art TTS model under 25MB
DeepSeek Coder: Let the Code Write Itself
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Tooling for the Common Objects In 3D dataset
Code for running inference and finetuning with SAM 3 model
Video understanding codebase from FAIR for reproducing video models
FAIR Sequence Modeling Toolkit 2
VMZ: Model Zoo for Video Modeling
Global weather forecasting model using graph neural networks and JAX
Tool for exploring and debugging transformer model behaviors
Repo of Qwen2-Audio chat & pretrained large audio language model
A Production-ready Reinforcement Learning AI Agent Library
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Implementation of the Surya Foundation Model for Heliophysics
High-Resolution Image Synthesis with Latent Diffusion Models
Suite with Real-ESRGAN, BSRGAN , IRCNN, GFPGAN & RIFE. v4.3