Bridging LLM and Recommender System
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding
Minimal reproduction of OneRec
Redundancy-aware KV Cache Compression for Reasoning Models
Visual intelligence for your home.
The official implementation of RAPTOR
Specify a github or local repo, github pull request
From nobody to big model (LLM) hero
Mastering Applied AI, One Concept at a Time
NeurIPS2025 Spotlight] Quantized Attention
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
Llama Chinese community, real-time aggregation
Large Language Model Principles and Practice Tutorial from Scratch
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm
Ongoing research training transformer models at scale
Long-form streaming TTS system for multi-speaker dialogue generation
AI memory OS for LLM and Agent systems
All-in-one AI framework & toolkit for Claude Code & Cursor
Document Index for Vectorless, Reasoning-based RAG
Harmonized and Coherent Human Image Animation
Latent Collaboration in Multi-Agent Systems
Run LLM prompts from your shell
A frontier, first-principles handbook