Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Uncertainty Quantification for Language Models, is a Python package
Streamlines and simplifies prompt design for both developers
A.S.E (AICGSecEval) is a repository-level AI-generated code security
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks
MemoryOS is designed to provide a memory operating system
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
E2B Desktop Sandbox for LLMs. E2B Sandbox
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
Your Personal Research Multi-Tool
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems