A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Streamlines and simplifies prompt design for both developers
A.S.E (AICGSecEval) is a repository-level AI-generated code security
AI-Driven Exploration in the Space of Code
Hypernetworks that adapt LLMs for specific benchmark tasks
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
E2B Desktop Sandbox for LLMs. E2B Sandbox
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
Your Personal Research Multi-Tool
Learning to Reason with Search for LLMs via Reinforcement Learning
Take control of your AI agents
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
Data Infrastructure providing an approach to multimodal AI workloads
An Efficient Web-enhanced Question Answering System
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
Unleashing 10,000+ Word Generation from Long Context LLMs