A lightweight vLLM implementation built from scratch
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Low-latency REST API for serving text-embeddings
OpenCompass is an LLM evaluation platform
GPT4V-level open-source multi-modal model based on Llama3-8B
This repository provides an advanced RAG
Chinese and English multimodal conversational language model
An efficient forwarding service designed for LLMs
Chinese Llama-3 LLMs) developed from Meta Llama 3
Open-source, high-performance Mixture-of-Experts large language model
Chat & pretrained large vision language model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
The unofficial python package that returns response of Google Bard
Ray Aviary - evaluate multiple LLMs easily
Serving multiple LoRA finetuned LLM as one
Serving LangChain LLM apps automagically with FastApi
Database system for building simpler and faster AI-powered application
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Training and serving large-scale neural networks
An interpretable and efficient predictor using pre-trained models
8.5K high quality grade school math problems