Python bindings for llama.cpp
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
Low-code app builder for RAG and multi-agent AI applications
A high-throughput and memory-efficient inference and serving engine
Data Lake for Deep Learning. Build, manage, and query datasets
A guidance language for controlling large language models
Application that simplifies the installation of AI-related projects
Revolutionizing Database Interactions with Private LLM Technology
LLM abstractions that aren't obstructions
Open-source end-to-end LLM Development Platform
Integrate cutting-edge LLM technology quickly and easily into your app
A modular graph-based Retrieval-Augmented Generation (RAG) system
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Open-source observability for your LLM application
BISHENG is an open LLM devops platform for next generation apps
Replace OpenAI GPT with another LLM in your app
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Database system for building simpler and faster AI-powered application
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Open source libraries and APIs to build custom preprocessing pipelines
Central interface to connect your LLM's with external data
A high-performance ML model serving framework, offers dynamic batching
Swirl queries any number of data sources with APIs