Python bindings for llama.cpp
Structured outputs for llms
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Low-code app builder for RAG and multi-agent AI applications
A high-throughput and memory-efficient inference and serving engine
A modular graph-based Retrieval-Augmented Generation (RAG) system
Open-source end-to-end LLM Development Platform
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Open-source observability for your LLM application
Central interface to connect your LLM's with external data
Multilingual sentence & image embeddings with BERT
Framework and no-code GUI for fine-tuning LLMs
LLM based data scientist, AI native data application
State-of-the-art Parameter-Efficient Fine-Tuning
Interact with your documents using the power of GPT
Revolutionizing Database Interactions with Private LLM Technology
Ongoing research training transformer models at scale
LLM training code for MosaicML foundation models
An AI personal assistant for your digital brain
⚡ Building applications with LLMs through composability ⚡
Integrate cutting-edge LLM technology quickly and easily into your app