An LLM Compiler for Parallel Function Calling
A high-throughput and memory-efficient inference and serving engine
A Simple and Universal Swarm Intelligence Engine
Ongoing research training transformer models at scale
Large-language-model & vision-language-model based on Linear Attention
A Frontier Mathematical Coding Agent
Making large AI models cheaper, faster and more accessible
Parallax is a distributed model serving framework
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Stanford NLP Python library for many human languages
95% on SimpleQA (e.g. Qwen3.6-27B on a 3090)
A state-of-the-art open visual language model
Development repository for the Triton language and compiler
Block Diffusion for Ultra-Fast Speculative Decoding
Language Model Reinforcement Learning Environments frameworks
Powerful framework for controlling Android and iOS devices
Seamlessly integrate LLMs as Python functions
The official repository for ERNIE 4.5 and ERNIEKit
Edit videos with Claude Code
A Python library for extracting structured information
FAIR Sequence Modeling Toolkit 2
Build production-ready AI agents in both Python and Typescript
A software construction tool
Fault-tolerant, highly scalable GPU orchestration