Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
A high-throughput and memory-efficient inference and serving engine
Advanced language and coding AI model
Agentic, Reasoning, and Coding (ARC) foundation models
Qwen3 is the large language model series developed by Qwen team
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Powerful AI language model (MoE) optimized for efficiency/performance
Framework to easily create LLM powered bots over any dataset
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LLM
Open-source, high-performance AI model with advanced reasoning
Interact with your documents using the power of GPT
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Operating LLMs in production
An elegent pytorch implement of transformers
A guidance language for controlling large language models
Access large language models from the command-line
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Inference code for CodeLlama models
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
ChatGLM2-6B: An Open Bilingual Chat LLM
An LLM-powered knowledge curation system that researches topics