VLLM download | SourceForge.net

vLLM is a fast and easy-to-use library for LLM inference and serving. High-throughput serving with various decoding algorithms, including parallel sampling, beam search, and more.

Features

State-of-the-art serving throughput
Efficient management of attention key and value memory with PagedAttention
Continuous batching of incoming requests
Optimized CUDA kernels
Seamless integration with popular HuggingFace models
Tensor parallelism support for distributed inference

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow VLLM

VLLM Web Site

User Reviews

Be the first to post a review of VLLM!

Additional Project Details

Programming Language

Python

Related Categories

Python Large Language Models (LLM)

Registered

2023-08-21

Similar Business Software

Inflection-2

We are proud to announce that we have completed training on Inflection-2, the best model in the world for its compute class and the second most capable LLM in the world today. Our mission at Inflection is to create a personal AI for everyone. Our new model, Inflection-2, is substantially more...

See Software
Granite Code

We introduce the Granite series of decoder-only code models for code generative tasks (e.g., fixing bugs, explaining code, documenting code), trained with code written in 116 programming languages. A comprehensive evaluation of the Granite Code model family on diverse tasks demonstrates that our...

See Software
Langbase

The complete LLM platform with a superior developer experience and robust infrastructure. Build, deploy, and manage hyper-personalized, streamlined, and trusted generative AI apps. Langbase is an open source OpenAI alternative, a new inference engine & AI tool for any LLM. The most...

See Software

Report inappropriate content

VLLM

A high-throughput and memory-efficient inference and serving engine

Features

Project Samples

Project Activity

Categories

License

Follow VLLM

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered