Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Advanced language and coding AI model
A high-throughput and memory-efficient inference and serving engine
Open-Source Financial Large Language Models
Interact with your documents using the power of GPT
lightweight package to simplify LLM API calls
⚡ Building applications with LLMs through composability ⚡
Qwen-Image is a powerful image generation foundation model
The official repo of Qwen chat & pretrained large language model
An AI personal assistant for your digital brain
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Python bindings for llama.cpp
Open-source observability for your LLM application
OpenCompass is an LLM evaluation platform
Inference Llama 2 in one file of pure C
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A high-performance ML model serving framework, offers dynamic batching
Tensor search for humans
Leveraging BERT and c-TF-IDF to create easily interpretable topics
Inference code for CodeLlama models
The Multi-Agent Framework