A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Uncertainty Quantification for Language Models, is a Python package
Streamlines and simplifies prompt design for both developers
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A.S.E (AICGSecEval) is a repository-level AI-generated code security
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
E2M converts various file types (doc, docx, epub, html, htm, url
Your Personal Research Multi-Tool
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
Generate probable usernames from LinkedIn company employee lists
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
Constrained Value Alignment via Safe Reinforcement Learning
An Efficient Web-enhanced Question Answering System
Bringing BERT into modernity via both architecture changes and scaling