CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A high-throughput and memory-efficient inference and serving engine
State-of-the-art Parameter-Efficient Fine-Tuning
Multilingual sentence & image embeddings with BERT
MobileLLM Optimizing Sub-billion Parameter Language Models
Low-code framework for building custom LLMs, neural networks
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Operating LLMs in production
Gemma open-weight LLM library, from Google DeepMind
Replace OpenAI GPT with another LLM in your app
Framework and no-code GUI for fine-tuning LLMs
Qwen3-Coder is the code version of Qwen3
Designed for text embedding and ranking tasks
A series of math-specific large language models of our Qwen2 series
Utilities intended for use with Llama models
Unified KV Cache Compression Methods for Auto-Regressive Models
Toolkit for conversational AI
Qwen3-omni is a natively end-to-end, omni-modal LLM
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Capable of understanding text, audio, vision, video
PyTorch library of curated Transformer models and their components
A state-of-the-art open visual language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Code for Language models can explain neurons in language models paper