Open-source observability for your LLM application
Inference Llama 2 in one file of pure C
This repository provides an advanced RAG
Qwen3-Coder is the code version of Qwen3
LLM training in simple, raw C/CUDA
Low-code framework for building custom LLMs, neural networks
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Operating LLMs in production
An elegent pytorch implement of transformers
Multilingual sentence & image embeddings with BERT
Adding guardrails to large language models
Advanced techniques for RAG systems
LLM based data scientist, AI native data application
Tools like web browser, computer access and code runner for LLMs
Integrating LLMs into structured NLP pipelines
A high-performance ML model serving framework, offers dynamic batching
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Framework that is dedicated to making neural data processing
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
An LLM-powered knowledge curation system that researches topics
Large-scale Self-supervised Pre-training Across Tasks, Languages, etc.
Set of tools to assess and improve LLM security
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Train a 26M-parameter GPT from scratch in just 2h
Research-oriented chatbot framework