Running large language models on a single GPU
Minimal and clean examples of machine learning algorithms
Transfer learning / domain adaptation / domain generalization
Learn how to develop, deploy and iterate on production-grade ML
AI Agent Evaluator & Red Team Platform
Schema-Guided Reasoning (SGR) has agentic system design
Designed for training LLM/VLM agents via RL
Agent framework that enables tool-use agent tasks
Extension of Google Research’s PaperBanana
Chat with your documents using local AI
A large-scale model of medical consultation in Chinese
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Streamlines and simplifies prompt design for both developers
A.S.E (AICGSecEval) is a repository-level AI-generated code security
AI-Driven Exploration in the Space of Code
Towards Efficient Self-Evolving Agent System
Chat with any codebase in under two minutes | Fully local
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Take control of your AI agents
Traditional Mandarin LLMs for Taiwan
Constrained Value Alignment via Safe Reinforcement Learning
Bringing BERT into modernity via both architecture changes and scaling
Scalable RL solution for advanced reasoning of language models