Python bindings for llama.cpp
Structured outputs for llms
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
Powerful AI language model (MoE) optimized for efficiency/performance
Qwen3-Coder is the code version of Qwen3
Port of Facebook's LLaMA model in C/C++
Open-source, high-performance AI model with advanced reasoning
Qwen3 is the large language model series developed by Qwen team
Low-code app builder for RAG and multi-agent AI applications
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Qwen-Image is a powerful image generation foundation model
A high-throughput and memory-efficient inference and serving engine
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Inference code for CodeLlama models
Integrate cutting-edge LLM technology quickly and easily into your app
PandasAI is a Python library that integrates generative AI
Code for the paper "Evaluating Large Language Models Trained on Code"
Framework and no-code GUI for fine-tuning LLMs
⚡ Building applications with LLMs through composability ⚡
CodeGeeX2: A More Powerful Multilingual Code Generation Model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Qwen3-omni is a natively end-to-end, omni-modal LLM
Operating LLMs in production