Agent framework that enables tool-use agent tasks
Extension of Google Research’s PaperBanana
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
SQL-Driven RAG Engine
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Hypernetworks that adapt LLMs for specific benchmark tasks
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
Your Personal Research Multi-Tool
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Traditional Mandarin LLMs for Taiwan
Daily updated lists of cloud, bot, and service IP ranges
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
Constrained Value Alignment via Safe Reinforcement Learning
An Efficient Web-enhanced Question Answering System
Bringing BERT into modernity via both architecture changes and scaling
Scalable RL solution for advanced reasoning of language models