Structured outputs for llms
Python bindings for llama.cpp
Toolkit for conversational AI
The Multi-Agent Framework
Run Local LLMs on Any Device. Open-source
A high-throughput and memory-efficient inference and serving engine
Ongoing research training transformer models at scale
lightweight package to simplify LLM API calls
Visual Instruction Tuning: Large Language-and-Vision Assistant
Revolutionizing Database Interactions with Private LLM Technology
Agentic, Reasoning, and Coding (ARC) foundation models
Phi-3.5 for Mac: Locally-run Vision and Language Models
Simple, Pythonic building blocks to evaluate LLM applications
An LLM-powered knowledge curation system that researches topics
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A modular graph-based Retrieval-Augmented Generation (RAG) system
The unofficial python package that returns response of Google Bard
An elegent pytorch implement of transformers
Interact with your documents using the power of GPT
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
State-of-the-art Parameter-Efficient Fine-Tuning
Access large language models from the command-line
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
Qwen3 is the large language model series developed by Qwen team
BISHENG is an open LLM devops platform for next generation apps