Python bindings for llama.cpp
Structured outputs for llms
Agentic, Reasoning, and Coding (ARC) foundation models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Run Local LLMs on Any Device. Open-source
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3 is the large language model series developed by Qwen team
Open-source, high-performance AI model with advanced reasoning
Qwen3-Coder is the code version of Qwen3
A high-throughput and memory-efficient inference and serving engine
⚡ Building applications with LLMs through composability ⚡
The Multi-Agent Framework
Central interface to connect your LLM's with external data
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The official repo of Qwen chat & pretrained large language model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Capable of understanding text, audio, vision, video
Qwen-Image is a powerful image generation foundation model
Framework and no-code GUI for fine-tuning LLMs
Inference code for CodeLlama models
Framework that is dedicated to making neural data processing
Repo of Qwen2-Audio chat & pretrained large audio language model
Low-code framework for building custom LLMs, neural networks
A modular graph-based Retrieval-Augmented Generation (RAG) system