Llama Chinese community, real-time aggregation
Large Language Model Principles and Practice Tutorial from Scratch
Run LLM prompts from your shell
Analyzing Hacker News discussions from a decade ago in hindsight
A New Axis of Sparsity for Large Language Models
LLM training in simple, raw C/CUDA
Training Large Language Model to Reason in a Continuous Latent Space
Scalable data pre processing and curation toolkit for LLMs
Implement CPU from scratch and play with large model deployments
The official repository for ERNIE 4.5 and ERNIEKit
Qwen3-omni is a natively end-to-end, omni-modal LLM
A large-scale model of medical consultation in Chinese
LongBench v2 and LongBench (ACL 25'&24')
On the Structural Pruning of Large Language Models
SQL-Driven RAG Engine
Streamlines and simplifies prompt design for both developers
MemoryOS is designed to provide a memory operating system
Towards Efficient Self-Evolving Agent System
Driving with Graph Visual Question Answering
Chat with any codebase in under two minutes | Fully local
E2M converts various file types (doc, docx, epub, html, htm, url
Unified KV Cache Compression Methods for Auto-Regressive Models
Learning to Reason with Search for LLMs via Reinforcement Learning
Traditional Mandarin LLMs for Taiwan
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG