Structured outputs for llms
Python bindings for llama.cpp
Port of Facebook's LLaMA model in C/C++
Toolkit for conversational AI
The Multi-Agent Framework
A high-throughput and memory-efficient inference and serving engine
Run Local LLMs on Any Device. Open-source
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Ongoing research training transformer models at scale
Low-code app builder for RAG and multi-agent AI applications
Visual Instruction Tuning: Large Language-and-Vision Assistant
lightweight package to simplify LLM API calls
Revolutionizing Database Interactions with Private LLM Technology
Agentic, Reasoning, and Coding (ARC) foundation models
Phi-3.5 for Mac: Locally-run Vision and Language Models
An LLM-powered knowledge curation system that researches topics
Simple, Pythonic building blocks to evaluate LLM applications
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
A modular graph-based Retrieval-Augmented Generation (RAG) system
The unofficial python package that returns response of Google Bard
An elegent pytorch implement of transformers
Interact with your documents using the power of GPT
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Access large language models from the command-line