Run Local LLMs on Any Device. Open-source
Openai style api for open large language models
LLM.swift is a simple and readable library
A high-performance ML model serving framework, offers dynamic batching
A RWKV management and startup tool, full automation, only 8MB
Tensor search for humans
An easy-to-use LLMs quantization package with user-friendly apis
A real time inference engine for temporal logical specifications
Database system for building simpler and faster AI-powered application
A computer vision framework to create and deploy apps in minutes
LLMFlows - Simple, Explicit and Transparent LLM Apps
CPU/GPU inference server for Hugging Face transformer models