Helps developers deploy LangChain runnables and chains as a REST API
Ready-to-run cloud templates for RAG
Jupyter notebook tutorials for OpenVINO
Running large language models on a single GPU
Low-latency REST API for serving text-embeddings
NLP Cloud serves high performance pre-trained or custom models
Tribuo - A Java machine learning library
Probabilistic reasoning and statistical analysis in TensorFlow
Host Agent for AWS CodeDeploy
Open source platform for the machine learning lifecycle
TFX is an end-to-end platform for deploying production ML pipelines
NLP Cloud serves high performance pre-trained or custom models for NER
Run serverless GPU workloads with fast cold starts on bare-metal
Deep Research framework, combining language models with tools
Open platform for sharing and discovering Stable Diffusion models
Official inference library for Mistral models
Desktop app that provides a graphical interface for OpenClaw AI
High-performance neural network inference framework for mobile
An AI agent development platform with all-in-one visual tools
Open-Source AI Camera. Empower any camera/CCTV
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
A guidance language for controlling large language models
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL
A model-agnostic Ruby Generative AI DSL and framework
Pruna is a model optimization framework built for developers