Wan2.2: Open and Advanced Large-Scale Video Generative Model
From Images to High-Fidelity 3D Assets
An experimental version of DeepSeek model
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Dataset of GPT-2 outputs for research in detection, biases, and more
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Qwen2.5-VL is the multimodal large language model series
gpt-oss-120b and gpt-oss-20b are two open-weight language models
State-of-the-art TTS model under 25MB
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
FAIR Sequence Modeling Toolkit 2
Video understanding codebase from FAIR for reproducing video models
DeepSeek Coder: Let the Code Write Itself
Tooling for the Common Objects In 3D dataset
Global weather forecasting model using graph neural networks and JAX
Tool for exploring and debugging transformer model behaviors
VMZ: Model Zoo for Video Modeling
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Repo of Qwen2-Audio chat & pretrained large audio language model
A Production-ready Reinforcement Learning AI Agent Library
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Implementation of the Surya Foundation Model for Heliophysics
High-Resolution Image Synthesis with Latent Diffusion Models
Chinese LLaMA & Alpaca large language model + local CPU/GPU training