Python bindings for llama.cpp
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Port of Facebook's LLaMA model in C/C++
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Powerful AI language model (MoE) optimized for efficiency/performance
Tiny vision language model
New family of code large language models (LLMs)
Qwen3-Coder is the code version of Qwen3
Phi-3.5 for Mac: Locally-run Vision and Language Models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open Source Speech Language Model
Qwen3 is the large language model series developed by Qwen team
Designed for text embedding and ranking tasks
Advanced language and coding AI model
Open-source, high-performance AI model with advanced reasoning
DeepSeek Coder: Let the Code Write Itself
Open-source large language model family from Tencent Hunyuan
Open-Source Financial Large Language Models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Official inference repo for FLUX.1 models
Revolutionizing Database Interactions with Private LLM Technology
Z80-μLM is a 2-bit quantized language model
Agentic, Reasoning, and Coding (ARC) foundation models