GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Repo for external large-scale work
Official PyTorch Implementation of "Scalable Diffusion Models"
Implementation of model parallel autoregressive transformers on GPUs
Open-Source Financial Large Language Models!
Open-source, high-performance Mixture-of-Experts large language model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Janus-Series: Unified Multimodal Understanding and Generation Models
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Open Multilingual Multimodal Chat LMs
ICLR2024 Spotlight: curation/training code, metadata, distribution
JetBrains’ 4B parameter code model for completions
Vision-language-action model for robot control via images and text
Tencent’s 36-language state-of-the-art translation model
OpenAI’s compact 20B open model for fast, agentic, and local use