Test-Time Reinforcement Learning
A simple yet powerful agent framework that delivers with models
Bridging LLM and Recommender System
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Semi-Structured Agentic Framework. Workflows build themselves
A Gym environment for web task automation
Parallax is a distributed model serving framework
Minimal reproduction of OneRec
Redundancy-aware KV Cache Compression for Reasoning Models
A powerful tool for automated LLM fuzzing
AI-powered tool for efficient abstract and PDF screening
The official implementation of RAPTOR
AI-driven multi-agent research assistant automating hypothesis
Synthetic data curation for post-training and data extraction
A high-quality PDF to Markdown tool based on large language model
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
Specify a github or local repo, github pull request
Easy token price estimates for 400+ LLMs. TokenOps
Deploy your agentic worfklows to production
MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
the terminal client for Ollama
Modular AI runtime for robots
How to optimize some algorithm in cuda
NeurIPS2025 Spotlight] Quantized Attention