Hackable and optimized Transformers building blocks
New family of code large language models (LLMs)
GPT4V-level open-source multi-modal model based on Llama3-8B
Advancing Open-source World Models
Contexts Optical Compression
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Pushing the Limits of Mathematical Reasoning in Open Language Models
Official implementation of DreamCraft3D
Chat & pretrained large vision language model
Audio foundation model excelling in audio understanding
A trainable PyTorch reproduction of AlphaFold 3
Open-source framework for intelligent speech interaction
A state-of-the-art open visual language model
A Production-ready Reinforcement Learning AI Agent Library
State-of-the-art (SoTA) text-to-video pre-trained model
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Designed for text embedding and ranking tasks
An Efficient Agentic Model for Computer Use
High-Fidelity and Controllable Generation of Textured 3D Assets
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
Fast-stable-diffusion + DreamBooth
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Collection of Gemma 3 variants that are trained for performance
High-resolution models for human tasks