Operating LLMs in production
An MLOps framework to package, deploy, monitor and manage models
Deploy your agentic worfklows to production
Training and deploying machine learning models on Amazon SageMaker
MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle
Jupyter notebook tutorials for OpenVINO
Replace OpenAI GPT with another LLM in your app
RF-DETR is a real-time object detection and segmentation
Ready-to-run cloud templates for RAG
Running large language models on a single GPU
Learn how to develop, deploy and iterate on production-grade ML
Low-latency REST API for serving text-embeddings
Probabilistic reasoning and statistical analysis in TensorFlow
Deep Research framework, combining language models with tools
Gen-AI Chat for Teams
Easiest and laziest way for building multi-agent LLMs applications
Deploy reasoning AI agents powered by agentic graph RAG in minutes
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL
Private AI platform for agents, enterprise search and RAG pipelines
Open source platform for the machine learning lifecycle
A guidance language for controlling large language models
Question and Answer based on Anything
Cybersecurity AI (CAI), the framework for AI Security
Official inference framework for 1-bit LLMs
Custom Chinese chatbot with Seq2Seq, GPT, and agent features