An end-to-end Data Scientist
Quick illustration of how one can easily read books together with LLMs
Spark-TTS Inference Code
A frontier, first-principles handbook
Foundation model for image generation
Analyzing Hacker News discussions from a decade ago in hindsight
Making RAG Simpler with Small and Open-Sourced Language Models
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Fast-stable-diffusion + DreamBooth
Ultimate meta-skill for generating best-in-class Claude Code skills
End-to-end pipeline converting generative videos
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Motion-controllable Video Generation via Latent Trajectory Guidance
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Hunyuan Translation Model Version 1.5
Persistent context and multi-instance coordination
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
Minimal Claude Code alternative. Single Python file, zero dependencies
A New Axis of Sparsity for Large Language Models
Anthropic's original performance take-home, now open for you to try
The knowledge and task management backbone for AI coding assistants
"Big Model" trains a visual multimodal VLM with 26M parameters
Simplifies the local serving of AI models from any source
Collection of Gemma 3 variants that are trained for performance