Harmonized and Coherent Human Image Animation
An autonomous AI researcher
An end-to-end Data Scientist
Ffree local self hosted video compressor webui
Quick illustration of how one can easily read books together with LLMs
A frontier, first-principles handbook
Foundation model for image generation
Analyzing Hacker News discussions from a decade ago in hindsight
Official code for StoryMem: Multi-shot Long Video Storytelling
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Ultimate meta-skill for generating best-in-class Claude Code skills
End-to-end pipeline converting generative videos
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Motion-controllable Video Generation via Latent Trajectory Guidance
Streaming Real-time Audio-Driven Avatar Generation
Let agents classify your bank transactions
Habit Tracker for the AI Coding Workshop
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
General plug-and-play inference library for Recursive Language Models
Anthropic's original performance take-home, now open for you to try
Socket.IO integration for Flask applications
The async Python driver for MongoDB and Tornado or asyncio
"Big Model" trains a visual multimodal VLM with 26M parameters