LLM Vision is an open-source integration for Home Assistant that adds multimodal large language model capabilities to smart home environments. The project enables Home Assistant to analyze images, video files, and live camera feeds using vision-capable AI models. Instead of relying only on traditional object detection pipelines, it allows users to send prompts about visual content and receive contextual descriptions or answers about what is happening in camera footage. The system can process events from surveillance platforms such as Frigate and convert them into meaningful summaries, notifications, or structured data for automation workflows. It also maintains a timeline of analyzed camera events that can be displayed in dashboards or queried through the assistant interface.

Features

  • Multimodal analysis of images, video files, and live camera streams
  • Integration with Home Assistant and surveillance tools such as Frigate
  • Natural language prompts to query visual events or camera snapshots
  • Timeline tracking of analyzed events for dashboards and history views
  • Memory capability to recognize people, pets, and objects across events
  • Support for multiple AI providers and OpenAI-compatible endpoints

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow LLM Vision

LLM Vision Web Site

Other Useful Business Software
Earn up to 16% annual interest with Nexo. Icon
Earn up to 16% annual interest with Nexo.

Access competitive interest rates on your digital assets.

Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
Get started with Nexo.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of LLM Vision!

Additional Project Details

Programming Language

Python

Related Categories

Python Large Language Models (LLM)

Registered

2026-03-09