Structured outputs for llms
Python bindings for llama.cpp
Run Local LLMs on Any Device. Open-source
An elegent pytorch implement of transformers
Port of Facebook's LLaMA model in C/C++
Low-code app builder for RAG and multi-agent AI applications
Interact with your documents using the power of GPT
Agentic, Reasoning, and Coding (ARC) foundation models
A high-throughput and memory-efficient inference and serving engine
Advanced language and coding AI model
Operating LLMs in production
Multilingual sentence & image embeddings with BERT
Open-source, high-performance AI model with advanced reasoning
An LLM-powered knowledge curation system that researches topics
Building applications with LLMs through composability
lightweight package to simplify LLM API calls
A modular graph-based Retrieval-Augmented Generation (RAG) system
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Fully automatic censorship removal for language models
Open source libraries and APIs to build custom preprocessing pipelines
Powerful AI language model (MoE) optimized for efficiency/performance
Open-Source Financial Large Language Models
ChatGLM2-6B: An Open Bilingual Chat LLM
A guidance language for controlling large language models