BISHENG is an open LLM devops platform for next generation apps
Self-hosted, community-driven, local OpenAI compatible API
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Text generator is a handy plugin for Obsidian
One API for plugins and datasets, one interface for prompt engineering
Simple, Pythonic building blocks to evaluate LLM applications
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open-source, high-performance AI model with advanced reasoning
Private Open AI on Kubernetes
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
An LLM-powered knowledge curation system that researches topics
Interact with your documents using the power of GPT
Integrate cutting-edge LLM technology quickly and easily into your app
Framework and no-code GUI for fine-tuning LLMs
Operating LLMs in production
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Open-weight, large-scale hybrid-attention reasoning model
Build AI-powered applications with React, Svelte, Vue, and Solid
LLM Frontend for Power Users
The production toolkit for LLMs. Observability, prompt management
A modular graph-based Retrieval-Augmented Generation (RAG) system
ChatGPT WebUI
Distribute and run LLMs with a single file
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph