Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
An elegent pytorch implement of transformers
Port of Facebook's LLaMA model in C/C++
Qwen3 is the large language model series developed by Qwen team
Low-code app builder for RAG and multi-agent AI applications
Interact with your documents using the power of GPT
Multilingual sentence & image embeddings with BERT
Fully automatic censorship removal for language models
A high-throughput and memory-efficient inference and serving engine
Advanced language and coding AI model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Operating LLMs in production
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Open-source, high-performance AI model with advanced reasoning
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Adding guardrails to large language models
An LLM-powered knowledge curation system that researches topics
Building applications with LLMs through composability
A modular graph-based Retrieval-Augmented Generation (RAG) system
Access large language models from the command-line
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)