Analyzing Hacker News discussions from a decade ago in hindsight
Fast-stable-diffusion + DreamBooth
A Pragmatic VLA Foundation Model
End-to-end pipeline converting generative videos
OpenTinker is an RL-as-a-Service infrastructure for foundation models
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Hunyuan Translation Model Version 1.5
Persistent context and multi-instance coordination
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
Z80-μLM is a 2-bit quantized language model
Simplifies the local serving of AI models from any source
Language Model Reinforcement Learning Environments frameworks
Build a machine learning model from a prompt
LLM training in simple, raw C/CUDA
Less Code, Lower Barrier, Faster Deployment
A simple, secure MCP-to-OpenAPI proxy server
Implementation of "MobileCLIP" CVPR 2024
Code release for Cut and Learn for Unsupervised Object Detection
Official implementation of Watermark Anything with Localized Messages
Training Large Language Model to Reason in a Continuous Latent Space
Video understanding codebase from FAIR for reproducing video models
Towards Real-World Vision-Language Understanding
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible