Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
An elegent pytorch implement of transformers
Qwen3 is the large language model series developed by Qwen team
Interact with your documents using the power of GPT
Multilingual sentence & image embeddings with BERT
Fully automatic censorship removal for language models
A high-throughput and memory-efficient inference and serving engine
Advanced language and coding AI model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Operating LLMs in production
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Open-source, high-performance AI model with advanced reasoning
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Adding guardrails to large language models
An LLM-powered knowledge curation system that researches topics
A modular graph-based Retrieval-Augmented Generation (RAG) system
Access large language models from the command-line
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A list of free LLM inference resources accessible via API
lightweight package to simplify LLM API calls
Powerful AI language model (MoE) optimized for efficiency/performance