An Open Source text-to-speech system built by inverting Whisper
Offline inference engine for art, real-time voice conversations
OCR expert VLM powered by Hunyuan's native multimodal architecture
RGBD video generation model conditioned on camera input
TokenSpeed is a speed-of-light LLM inference engine
Build a modern LLM from scratch. Every line commented
Curated list of classic, high-quality computer science books
Classic papers and resources on recommendation
Focus on creating classic Python small examples and cases
Bridging LLM and Recommender System
Semi-Structured Agentic Framework. Workflows build themselves
Minimal reproduction of OneRec
A powerful tool for automated LLM fuzzing
Mastering Applied AI, One Concept at a Time
NeurIPS2025 Spotlight] Quantized Attention
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
A frontier, first-principles handbook
Habit Tracker for the AI Coding Workshop
Z80-μLM is a 2-bit quantized language model
PPTAgent: Generating and Evaluating Presentations
Implementation of "MobileCLIP" CVPR 2024
ChatGLM2-6B: An Open Bilingual Chat LLM
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image