Python bindings for llama.cpp
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source, high-performance AI model with advanced reasoning
Port of Facebook's LLaMA model in C/C++
Revolutionizing Database Interactions with Private LLM Technology
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Phi-3.5 for Mac: Locally-run Vision and Language Models
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Inference framework for 1-bit LLMs
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of model parallel autoregressive transformers on GPUs
LLaMA: Open and Efficient Foundation Language Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Code version of Qwen3, the large language model by Alibaba Cloud
Powerful large language model (LLM) from Alibaba Cloud
Open-Source Financial Large Language Models!
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Open-source, high-performance Mixture-of-Experts large language model
Qwen (通义千问) chat/pretrained large language model Alibaba Cloud