A simple but powerful self-hosted finance tracker
Analyzing Hacker News discussions from a decade ago in hindsight
Official code for StoryMem: Multi-shot Long Video Storytelling
Making RAG Simpler with Small and Open-Sourced Language Models
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Ultimate meta-skill for generating best-in-class Claude Code skills
End-to-end pipeline converting generative videos
Streaming Real-time Audio-Driven Avatar Generation
Let agents classify your bank transactions
Hunyuan Translation Model Version 1.5
Persistent context and multi-instance coordination
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
Claude Code skill that researches any topic across Reddit + X
"Big Model" trains a visual multimodal VLM with 26M parameters
A pretty sweet vulnerability scanner
Language Model Reinforcement Learning Environments frameworks
An anomaly detection library comprising state-of-the-art algorithms
Collection of reference environments, offline reinforcement learning
Simple and easily configurable grid world environments
Curl cryptocurrencies exchange rates
Google CTF
A simple, secure MCP-to-OpenAPI proxy server
Implementation of "MobileCLIP" CVPR 2024