TTS with kokoro and onnx runtime
AI agent harness for AI coding agents
Tools like web browser, computer access and code runner for LLMs
High-performance inference server for text embeddings models API layer
Chat with your documents using local AI
Our first fully AI generated deep learning system
NVIDIA Federated Learning Application Runtime Environment
Deploy and share agents with open infrastructure
OpenSandbox is a general-purpose sandbox platform for AI applications
The Modular Platform (includes MAX & Mojo)
The most reliable AI agent framework that supports MCP
Elyra extends JupyterLab with an AI centric approach
Universal LLM Deployment Engine with ML Compilation
Specify a github or local repo, github pull request
Implementation of "MobileCLIP" CVPR 2024
Operating LLMs in production
Build your own Cowork, AI Scientist and other SoTA Agents
Package and deploy machine learning models using Docker containers
SGLang is a fast serving framework for large language models
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Streamlines and simplifies prompt design for both developers
Self-learning data agent that grounds its answers in layers of content
Powering Amazon custom machine learning chips
Build, evaluate and train General Multi-Agent Assistance with ease
Offline Text To Speech synthesis for python