LightLLM is a Python-based LLM (Large Language Model) inference
A guidance language for controlling large language models
Multilingual sentence & image embeddings with BERT
How to optimize some algorithm in cuda
ChatGLM2-6B: An Open Bilingual Chat LLM
A high-performance ML model serving framework, offers dynamic batching
MobileLLM Optimizing Sub-billion Parameter Language Models
An efficient forwarding service designed for LLMs
A New Axis of Sparsity for Large Language Models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)