Low-latency REST API for serving text-embeddings
Technical principles related to large models
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Repo of Qwen2-Audio chat & pretrained large audio language model
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Data Lake for Deep Learning. Build, manage, and query datasets
Qwen3-omni is a natively end-to-end, omni-modal LLM
Set of tools to assess and improve LLM security
GLM-4-Voice | End-to-End Chinese-English Conversational Model
OpenCompass is an LLM evaluation platform
Ray Aviary - evaluate multiple LLMs easily
Building applications with LLMs through composability
Qwen2.5-VL is the multimodal large language model series
State-of-the-art Parameter-Efficient Fine-Tuning
Serving LangChain LLM apps automagically with FastApi
Inference Llama 2 in one file of pure C
A series of math-specific large language models of our Qwen2 series
An AI personal assistant for your digital brain
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
LLM training in simple, raw C/CUDA
Ling is a MoE LLM provided and open-sourced by InclusionAI
AI agent that streamlines the entire process of data analysis
Gorilla: An API store for LLMs
Learn AI and LLMs from scratch using free resources