Built for demanding AI workflows
C++-based high-performance parallel environment execution engine
Lightning-fast, on-device TTS, running natively via ONNX
The Modular Platform (includes MAX & Mojo)
Java enterprise application development framework
Towards Human-Sounding Speech
Document content and metadata extraction microservice
Agent framework and applications built upon Qwen>=3.0
A high-performance inference engine for AI models
Official inference framework for 1-bit LLMs
Accurate × Fast × Comprehensive
AI gateway with token compression for Claude Code, Codex, and more
A AI-Driven, Distributed and high-performance monitoring system
TensorRT LLM provides users with an easy-to-use Python API
An efficient forwarding service designed for LLMs
A lightweight, lightning-fast, in-process vector database
Multi-agent autonomous startup system for Claude Code
OpenMLDB is an open-source machine learning database
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Open-Source Analytics Infrastructure
A simple, performant and scalable Jax LLM
High-performance Inference and Deployment Toolkit for LLMs and VLMs
slime is an LLM post-training framework for RL Scaling
A scalable inference server for models optimized with OpenVINO
The best ChatGPT that $100 can buy