Lucebox

Fast LLM speculative inference server for consumer hardware

This is an exact mirror of the Lucebox project, hosted at https://github.com/Luce-Org/lucebox-hub. SourceForge is not affiliated with Lucebox.

Downloads: 0 This Week

Last Update: 21 hours ago

Get an email when there's a new version of Lucebox

Windows Mac Linux BSD ChromeOS

Lucebox is a local LLM inference server built for fast generation on consumer hardware. It focuses on custom kernels, speculative prefill, speculative decoding, and model-specific optimizations rather than a generic one-size-fits-all runtime. The project includes a native C++ HTTP server with an OpenAI-compatible API, making it usable with tools that already speak the Chat Completions format. It supports CUDA and ROCm workflows, with Docker images for NVIDIA and AMD GPU setups. The repository also includes harnesses for testing compatibility with clients such as Claude Code, Codex, OpenCode, Hermes, Pi, OpenClaw, and Open WebUI. It is most useful for developers and AI enthusiasts who want to run optimized local models with lower latency, faster token generation, and hardware-aware inference behavior.

Features

Local LLM inference server
OpenAI-compatible HTTP API
Speculative prefill and decoding
CUDA and ROCm GPU support
Docker deployment workflows
Client harness compatibility testing

Project Samples

Lucebox Screenshot 1

Lucebox Screenshot 2

Project Activity

See All Activity >

Categories

Artificial Intelligence

License

Apache License V2.0

Follow Lucebox

Lucebox Web Site

Other Useful Business Software

Atera - an All-in-one platform for IT management Icon

Atera - an All-in-one platform for IT management

Ideal for IT departments and MSPs (managed service providers)

Your IT essentials, integrated & elevated. Take your IT management from automated to autonomous, download Atera's agent to start your free trial!

Try Atera now

Rate This Project

Login To Rate This Project

User Reviews

Be the first to post a review of Lucebox!

Additional Project Details

Programming Language

Related Categories

C++ Artificial Intelligence Software

Registered

21 hours ago

Report inappropriate content

Ship Agents Faster

Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free

Recommended Projects

7-Zip
A free file archiver for extremely high compression
Clonezilla
A partition and disk imaging/cloning program
Apache OpenOffice
The free and Open Source productivity suite
KeePass
A lightweight and easy-to-use password manager
DeSmuME
DeSmuME is a Nintendo DS emulator