Fully automatic censorship removal for language models
AI-Driven Exploration in the Space of Code
Compress tool outputs, logs, files, and RAG chunks
Recipes to train reward model for RLHF
Text-space optimizer that trains reusable natural-language skills
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Big Model Application Development Practice 1
A course of learning LLM inference serving on Apple Silicon
The official repository for ERNIE 4.5 and ERNIEKit
A lightweight vLLM implementation built from scratch
Minimal reproduction of OneRec
How to optimize some algorithm in cuda
A frontier, first-principles handbook
LightLLM is a Python-based LLM (Large Language Model) inference
CV, NLP, LLM project applications, and advanced engineering deployment
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
NeurIPS2025 Spotlight] Quantized Attention
Diversity-driven optimization and large-model reasoning ability
Technical principles related to large models
Control Gmail, Google Calendar, Docs, Sheets, Slides, Chat, Forms
⚡ Building applications with LLMs through composability ⚡
Accelerate local LLM inference and finetuning
Semi-Structured Agentic Framework. Workflows build themselves