Operating LLMs in production
Deploy your agentic worfklows to production
Replace OpenAI GPT with another LLM in your app
A secure low code honeypot framework
Helps developers deploy LangChain runnables and chains as a REST API
Personal AI Notebooks. Organize files & webpages and generate notes
Low-latency REST API for serving text-embeddings
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
A guidance language for controlling large language models
Cybersecurity AI (CAI), the framework for AI Security
ChatGLM2-6B: An Open Bilingual Chat LLM
High-performance inference framework for large language models
Performance-optimized AI inference on your GPUs
Production-grade platform for building agentic IM bots
On the Structural Pruning of Large Language Models
Implement CPU from scratch and play with large model deployments
Open source and self-hostable browser automation library for AI agents
Research papers and blogs to transition to AI Engineering
Unified interface for AI chat, Agentic workflows and more
Self-hosted AI accounting app. LLM analyzer for receipts
Cloud-native runtime for agentic AI
Application implementation with business use cases
An open-source, code-first Java toolkit
Open-source LLM load balancer and serving platform for hosting LLMs
A simple, performant and scalable Jax LLM