Ultimate meta-skill for generating best-in-class Claude Code skills
End-to-end pipeline converting generative videos
A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Collection of Gemma 3 variants that are trained for performance
Collection of reference environments, offline reinforcement learning
Simple and easily configurable grid world environments
LLM training in simple, raw C/CUDA
Fast and accurate AI powered file content types detection
Code release for Cut and Learn for Unsupervised Object Detection
Official implementation of Watermark Anything with Localized Messages
Training Large Language Model to Reason in a Continuous Latent Space
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
Tool for exploring and debugging transformer model behaviors
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
A Unified Framework for Text-to-3D and Image-to-3D Generation
Personalize Any Characters with a Scalable Diffusion Transformer
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Extensible AGI Framework
Collaborative & Open-Source Quality Assurance for all AI models