A @ClickHouse fork that supports high-performance vector search
An orchestration framework for agentic AI and LLM applications
Unified interface for AI chat, Agentic workflows and more
Dance with Intelligence in Your Code
Evaluate your LLM's response with Prometheus and GPT4
Extension of Google Research’s PaperBanana
Alibaba's high-performance LLM inference engine for diverse apps
Fetch source code for npm packages
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
A minimal LLM chat app that runs entirely in your browser
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Uncertainty Quantification for Language Models, is a Python package
Streamlines and simplifies prompt design for both developers
Production-ready AI chat. Start here and make it your own
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Join a time-traveling adventure where you meet history’s legends
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
MemoryOS is designed to provide a memory operating system
UCCL is an efficient communication library for GPUs
Cloud-native runtime for agentic AI