API-for-Open-LLM is a lightweight API server designed for deploying and serving open large language models (LLMs), offering a simple way to integrate LLMs into applications.

Features

  • Provides a REST API for serving open LLMs
  • Supports multiple backends, including Hugging Face models
  • Enables GPU and CPU-based inference
  • Offers token streaming for real-time responses
  • Supports user authentication and request management
  • Open-source and customizable for different use cases

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow API-for-Open-LLM

API-for-Open-LLM Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of API-for-Open-LLM!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python

Related Categories

Python Natural Language Processing (NLP) Tool, Python LLM Inference Tool

Registered

2025-01-22