Access large language models from the command-line
NeurIPS2025 Spotlight] Quantized Attention
Production-grade platform for building agentic IM bots
One-stop solution for creating your digital avatar from chat history
Code for the paper "Evaluating Large Language Models Trained on Code"
Qwen3-omni is a natively end-to-end, omni-modal LLM
Operating LLMs in production
Ongoing research training transformer models at scale
Gracefully face hCaptcha challenge with multimodal llms
Language-model investigation agent with a terminal UI
Capable of understanding text, audio, vision, video
LangChain powered shell command generator and runner CLI
MemoryOS is designed to provide a memory operating system
Unleashing 10,000+ Word Generation from Long Context LLMs
A lightweight framework for building LLM-based agents
A dataset consists of 15,140 ChatGPT prompts from Reddit
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
The official implementation of RAPTOR
A simple, easy-to-hack GraphRAG implementation
A frontier, first-principles handbook
Replace OpenAI GPT with another LLM in your app
Unified KV Cache Compression Methods for Auto-Regressive Models
Take control of your AI agents
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
High-performance Inference and Deployment Toolkit for LLMs and VLMs