Openai style api for open large language models
The unofficial python package that returns response of Google Bard
Run Local LLMs on Any Device. Open-source
Port of OpenAI's Whisper model in C/C++
The free, Open Source alternative to OpenAI, Claude and others
Easiest and laziest way for building multi-agent LLMs applications
Low-latency REST API for serving text-embeddings
User-friendly AI Interface
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Optimizing inference proxy for LLMs
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Replace OpenAI GPT with another LLM in your app
A library for accelerating Transformer models on NVIDIA GPUs
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Private Open AI on Kubernetes
Operating LLMs in production
A RWKV management and startup tool, full automation, only 8MB
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
OpenAI swift async text to image for SwiftUI app using OpenAI
Large Language Model Text Generation Inference
Simplifies the local serving of AI models from any source
Data manipulation and transformation for audio signal processing
A GPU-accelerated library containing highly optimized building blocks
A high-performance ML model serving framework, offers dynamic batching
Unified Model Serving Framework