Operating LLMs in production
Deploy your agentic worfklows to production
Replace OpenAI GPT with another LLM in your app
A secure low code honeypot framework
Helps developers deploy LangChain runnables and chains as a REST API
Personal AI Notebooks. Organize files & webpages and generate notes
Low-latency REST API for serving text-embeddings
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
A guidance language for controlling large language models
Cybersecurity AI (CAI), the framework for AI Security
ChatGLM2-6B: An Open Bilingual Chat LLM
High-performance inference framework for large language models
Performance-optimized AI inference on your GPUs
Production-grade platform for building agentic IM bots
On the Structural Pruning of Large Language Models
Open source and self-hostable browser automation library for AI agents
Implement CPU from scratch and play with large model deployments
Research papers and blogs to transition to AI Engineering
Unified interface for AI chat, Agentic workflows and more
Self-hosted AI accounting app. LLM analyzer for receipts
Cloud-native runtime for agentic AI
Application implementation with business use cases
An open-source, code-first Java toolkit
Open-source LLM load balancer and serving platform for hosting LLMs
A simple, performant and scalable Jax LLM