Hollama is a lightweight open-source chat application designed to run entirely within the browser while interacting with large language model servers. The project provides a minimal but powerful user interface for communicating with local or remote LLMs, including servers powered by Ollama or OpenAI-compatible APIs. Because the application runs as a static web interface, it does not require complex backend infrastructure and can be easily deployed or self-hosted. Hollama supports both text-based and multimodal interactions, allowing users to work with models that process images as well as text. The interface includes features for editing prompts, retrying responses, copying generated code snippets, and storing conversation history locally within the browser. Mathematical expressions can be rendered using KaTeX, and Markdown formatting allows code blocks and structured outputs to appear clearly within conversations.
Features
- Browser-based LLM chat interface requiring no dedicated backend
- Support for Ollama and OpenAI compatible model servers
- Compatibility with both text and vision models
- Local storage of conversations directly within the browser
- Markdown rendering with syntax highlighting and KaTeX math support
- Customizable system prompts and advanced model parameters