Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
A high-throughput and memory-efficient inference and serving engine
Advanced language and coding AI model
Agentic, Reasoning, and Coding (ARC) foundation models
Qwen3 is the large language model series developed by Qwen team
LLM
Powerful AI language model (MoE) optimized for efficiency/performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
lightweight package to simplify LLM API calls
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Open-source, high-performance AI model with advanced reasoning
An elegent pytorch implement of transformers
Interact with your documents using the power of GPT
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Operating LLMs in production
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
ChatGLM2-6B: An Open Bilingual Chat LLM
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
PandasAI is a Python library that integrates generative AI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Qwen3-Coder is the code version of Qwen3