ONNX Runtime: cross-platform, high performance ML inferencing
Rust async runtime based on io-uring
TTS with kokoro and onnx runtime
LiteRT is the new name for TensorFlow Lite (TFLite)
ByteHook is an Android PLT hook library
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
MLX: An array framework for Apple silicon
Port of OpenAI's Whisper model in C/C++
Tools like web browser, computer access and code runner for LLMs
OpenVINO™ Toolkit repository
Deep learning at the speed of light
A retargetable MLIR-based machine learning compiler runtime toolkit
Port of Facebook's LLaMA model in C/C++
A self-hostable CDN for databases
Open source solution that can meet the requirements of workloads
NVIDIA Federated Learning Application Runtime Environment
Build your own Cowork, AI Scientist and other SoTA Agents
Desktop Agent for Any Task
SGLang is a fast serving framework for large language models
Build AI-powered applications with React, Svelte, Vue, and Solid
Run Stable Diffusion on Mac natively
Deploy and share agents with open infrastructure
The most reliable AI agent framework that supports MCP
Secure open source cloud runtime for AI apps & AI agents
Open source codebase for Scale Agentex