Openai style api for open large language models
Optimizing inference proxy for LLMs
Easiest and laziest way for building multi-agent LLMs applications
Replace OpenAI GPT with another LLM in your app
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Implementation of "Tree of Thoughts