Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Qwen2.5-VL is the multimodal large language model series
Conversational voice AI agents
An open-source RAG-based tool for chatting with your documents
Spring AI Alibaba examples for building and testing AI apps
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
The repository provides code for running inference with SAM 2
Build voice-based LLM agents. Modular + open source
Easy-to-use and high-performance NLP and LLM framework
Agentic LLM Vulnerability Scanner / AI red teaming kit
Deep learning optimization library: makes distributed training easy
The highest-scoring AI memory system ever benchmarked
An MCP server for interacting with Google Colab
Secure local-first microVM sandbox for running untrusted code fast
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
Enhances Tesseract OCR output using LLMs (local or API)
Performance-optimized AI inference on your GPUs
Collection of awesome LLM apps with AI Agents and RAG using OpenAI
Compress tool outputs, logs, files, and RAG chunks
A Pythonic framework to simplify AI service building
Persistent AI memory using local Markdown knowledge graphs
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A SOTA open-source image editing model
FlashInfer: Kernel Library for LLM Serving
Implementation of TurboQuant (ICLR 2026)