Why use many token when few token do trick
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
End-to-end protocol replay toolkit for ChatGPT Plus/Team/Pro sub
State-of-the-art Parameter-Efficient Fine-Tuning
MCP server enabling AI agents to control and automate Windows OS
Framework for building, orchestrating, and deploying AI agents
A PyTorch-based Speech Toolkit
Modular AI runtime for robots
Persistent context and multi-instance coordination
Lemonade helps users run local LLMs with the highest performance
Ready-to-use OCR with 80+ supported languages
A simple yet powerful agent framework for personal assistants
MobileLLM Optimizing Sub-billion Parameter Language Models
MTEB: Massive Text Embedding Benchmark
JAX-based neural network library
State-of-the-art TTS model under 25MB
SOTA Open Source TTS
State-of-the-art (SoTA) text-to-video pre-trained model
Reference PyTorch implementation and models for DINOv3
Official inference repo for FLUX.2 models
The AI toolkit for the AI developer
Build your own Cowork, AI Scientist and other SoTA Agents
Context management for Claude Code. Hooks maintain state via ledgers
A Systematic Framework for Interactive World Modeling
Educational framework exploring multi-agent orchestration