C++ library for high performance inference on NVIDIA GPUs
Production-grade platform for building agentic IM bots
Performance-optimized AI inference on your GPUs
Easiest and laziest way for building multi-agent LLMs applications
lightweight, standalone C++ inference engine for Google's Gemma models
Deploy reasoning AI agents powered by agentic graph RAG in minutes
An open-source, code-first Java toolkit
Official inference framework for 1-bit LLMs
Voice Recognition to Text Tool
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Enterprise AI agent platform for workflows, models, and RAG apps
Follow along with my AI Agents Masterclass videos
A scalable inference server for models optimized with OpenVINO
Your Personal AI Assistant; easy to install, deploy on local or coud
Fast SQL-based BI tool for real-time dashboards and analytics
Framework for building AI-powered interactive digital humans and agent
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Build high-quality LLM apps
Private AI platform for agents, enterprise search and RAG pipelines
Question and Answer based on Anything
Framework for building, orchestrating, and deploying AI agents
Universal database MCP server connecting to MySQL, PostgreSQL
Streamline your ML workflow
The LLM API management & distribution system
Open source platform for managing, testing, and deploying AI apps