Open-source observability for your LLM application
Operating LLMs in production
Inference Llama 2 in one file of pure C
An elegent pytorch implement of transformers
LLM training in simple, raw C/CUDA
Low-code framework for building custom LLMs, neural networks
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
This repository provides an advanced RAG
Qwen3-Coder is the code version of Qwen3
Multilingual sentence & image embeddings with BERT
Adding guardrails to large language models
LLM based data scientist, AI native data application
Tools like web browser, computer access and code runner for LLMs
Advanced techniques for RAG systems
A high-performance ML model serving framework, offers dynamic batching
Framework that is dedicated to making neural data processing
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
An LLM-powered knowledge curation system that researches topics
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
Integrating LLMs into structured NLP pipelines
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Set of tools to assess and improve LLM security
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Train a 26M-parameter GPT from scratch in just 2h
Research-oriented chatbot framework