Lorax is a multi-LoRA (Low-Rank Adaptation) inference server that scales to thousands of fine-tuned Large Language Models (LLMs). It enables efficient deployment and management of numerous fine-tuned models, facilitating scalable AI applications. Lorax is designed to handle high concurrency and provides a robust infrastructure for serving multiple LLMs simultaneously.
Features
- Multi-LoRA inference server
- Scales to thousands of fine-tuned LLMs
- Efficient deployment of multiple models
Categories
LLM InferenceLicense
Apache License V2.0Follow LoRAX
Other Useful Business Software
Fully Managed MySQL, PostgreSQL, and SQL Server
Cloud SQL handles your database ops end to end, so you can focus on your app.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of LoRAX!