CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Generating Immersive, Explorable, and Interactive 3D Worlds
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
DeepSeek Coder: Let the Code Write Itself
High-Resolution Image Synthesis with Latent Diffusion Models
The official repo of Qwen chat & pretrained large language model
Generate Any 3D Scene in Seconds
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Reference PyTorch implementation and models for DINOv3
Multimodal Diffusion with Representation Alignment
Pushing the Limits of Mathematical Reasoning in Open Language Models
Qwen-Image is a powerful image generation foundation model
Global weather forecasting model using graph neural networks and JAX
Large Multimodal Models for Video Understanding and Editing
Qwen3-Coder is the code version of Qwen3
Revolutionizing Database Interactions with Private LLM Technology
Industrial-level controllable zero-shot text-to-speech system
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Diversity-driven optimization and large-model reasoning ability
Capable of understanding text, audio, vision, video