Python bindings for llama.cpp
Structured outputs for llms
Agentic, Reasoning, and Coding (ARC) foundation models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Qwen3 is the large language model series developed by Qwen team
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Qwen3-Coder is the code version of Qwen3
A high-throughput and memory-efficient inference and serving engine
The Multi-Agent Framework
Low-code app builder for RAG and multi-agent AI applications
lightweight package to simplify LLM API calls
⚡ Building applications with LLMs through composability ⚡
Central interface to connect your LLM's with external data
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Qwen-Image is a powerful image generation foundation model
Operating LLMs in production
OpenDAN is an open source Personal AI OS
Interact with your documents using the power of GPT
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Framework that is dedicated to making neural data processing
Integrate cutting-edge LLM technology quickly and easily into your app