lightweight, standalone C++ inference engine for Google's Gemma models
Deploy reasoning AI agents powered by agentic graph RAG in minutes
Performance-optimized AI inference on your GPUs
An open-source, code-first Java toolkit
Official inference framework for 1-bit LLMs
Voice Recognition to Text Tool
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Enterprise AI agent platform for workflows, models, and RAG apps
Follow along with my AI Agents Masterclass videos
20+ high-performance LLMs with recipes to pretrain, finetune at scale
Build high-quality LLM apps
Your Personal AI Assistant; easy to install, deploy on local or coud
Question and Answer based on Anything
Fast SQL-based BI tool for real-time dashboards and analytics
Framework for building AI-powered interactive digital humans and agent
Private AI platform for agents, enterprise search and RAG pipelines
Framework for building, orchestrating, and deploying AI agents
Universal database MCP server connecting to MySQL, PostgreSQL
Streamline your ML workflow
The LLM API management & distribution system
Open source platform for managing, testing, and deploying AI apps
Package and deploy machine learning models using Docker containers
Full-stack AI Red Teaming platform
A toolkit to optimize ML models for deployment for Keras & TensorFlow
One brain, many harnesses. Portable .agent/ folder