Persistent context and multi-instance coordination
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
Minimal Claude Code alternative. Single Python file, zero dependencies
SimpleMem: Efficient Lifelong Memory for LLM Agents
A New Axis of Sparsity for Large Language Models
Z80-μLM is a 2-bit quantized language model
Anthropic's original performance take-home, now open for you to try
Socket.IO integration for Flask applications
The async Python driver for MongoDB and Tornado or asyncio
"Big Model" trains a visual multimodal VLM with 26M parameters
A pretty sweet vulnerability scanner
Simplifies the local serving of AI models from any source
Collection of Gemma 3 variants that are trained for performance
Detect and validate 500+ types of hardcoded secrets
Language Model Reinforcement Learning Environments frameworks
An anomaly detection library comprising state-of-the-art algorithms
Improve human sleep through scientifically
Collection of reference environments, offline reinforcement learning
Simple and easily configurable grid world environments
Curl cryptocurrencies exchange rates
Enables the best performance on NVIDIA RTX Graphics Cards
Spanish-language course repository that teaches fundamentals of SQL
A minimal, modern Python project template
Rename anything