Running large language models on a single GPU
Minimal and clean examples of machine learning algorithms
Advanced AI Explainability for computer vision
Transfer learning / domain adaptation / domain generalization
Designed for training LLM/VLM agents via RL
Agent framework that enables tool-use agent tasks
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Streamlines and simplifies prompt design for both developers
Linkedin Automation Tool
MemoryOS is designed to provide a memory operating system
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Traditional Mandarin LLMs for Taiwan
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Recipes to train reward model for RLHF
A tension reasoning engine over 131 S-class problems
An Efficient Web-enhanced Question Answering System
An LLM Compiler for Parallel Function Calling