Low-latency REST API for serving text-embeddings
Technical principles related to large models
Repo of Qwen2-Audio chat & pretrained large audio language model
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Data Lake for Deep Learning. Build, manage, and query datasets
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Set of tools to assess and improve LLM security
GLM-4-Voice | End-to-End Chinese-English Conversational Model
OpenCompass is an LLM evaluation platform
Ray Aviary - evaluate multiple LLMs easily
Building applications with LLMs through composability
Qwen2.5-VL is the multimodal large language model series
State-of-the-art Parameter-Efficient Fine-Tuning
Inference Llama 2 in one file of pure C
A series of math-specific large language models of our Qwen2 series
An AI personal assistant for your digital brain
Serving LangChain LLM apps automagically with FastApi
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Learn AI and LLMs from scratch using free resources
LLM training in simple, raw C/CUDA
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
A state-of-the-art open visual language model
Ling is a MoE LLM provided and open-sourced by InclusionAI