A tool to use the Ai2 Open Coding Agents Soft-Verified Agents
Hunyuan Translation Model Version 1.5
Persistent context and multi-instance coordination
Block Diffusion for Ultra-Fast Speculative Decoding
Multimodal embedding and reranking models built on Qwen3-VL
A New Axis of Sparsity for Large Language Models
Z80-μLM is a 2-bit quantized language model
Simplifies the local serving of AI models from any source
Language Model Reinforcement Learning Environments frameworks
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Build a machine learning model from a prompt
LLM training in simple, raw C/CUDA
Less Code, Lower Barrier, Faster Deployment
A simple, secure MCP-to-OpenAPI proxy server
Implementation of "MobileCLIP" CVPR 2024
Code release for Cut and Learn for Unsupervised Object Detection
Official implementation of Watermark Anything with Localized Messages
Training Large Language Model to Reason in a Continuous Latent Space
Video understanding codebase from FAIR for reproducing video models
Towards Real-World Vision-Language Understanding
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Superduper: Integrate AI models and machine learning workflows
Scalable data pre processing and curation toolkit for LLMs
Turns Data and AI algorithms into production-ready web applications
Python Stream Processing