Focus on creating classic Python small examples and cases
Data science interview questions and answers
Solve puzzles. Learn CUDA
Build multimodal AI applications with cloud-native stack
An efficient forwarding service designed for LLMs
Test-Time Reinforcement Learning
Bridging LLM and Recommender System
Semi-Structured Agentic Framework. Workflows build themselves
Minimal reproduction of OneRec
A powerful tool for automated LLM fuzzing
Visual intelligence for your home.
The official implementation of RAPTOR
From nobody to big model (LLM) hero
MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
How to optimize some algorithm in cuda
NeurIPS2025 Spotlight] Quantized Attention
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
Open-source model for program synthesis
Llama Chinese community, real-time aggregation
RAG Search API
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Ready-to-run cloud templates for RAG
Building an Intelligent Agent from Scratch