Vision Agents is an open-source Python framework for building real-time voice and video AI agents. It is designed for applications that need to watch, listen, understand, and respond with very low latency. The framework can combine vision models, speech models, LLMs, and real-time transport providers into one agent workflow. It supports use cases such as live coaching, telehealth, customer support, security monitoring, interactive video assistants, and voice-controlled tools. Vision Agents is model-agnostic, so developers can connect providers such as OpenAI, Gemini, Claude, Hugging Face, YOLO, Roboflow, and others. Its main value is giving developers a flexible foundation for multimodal agents that operate on live audio and video instead of only static prompts.

Features

  • Real-time voice and video AI agents
  • Python framework for multimodal workflows
  • Model-agnostic provider integration
  • WebRTC and edge-network support
  • Vision, speech, and LLM pipelines
  • Low-latency interactive agent design

Project Samples

Project Activity

See All Activity >

Categories

Agentic AI

License

Apache License V2.0

Follow Vision Agents

Vision Agents Web Site

Other Useful Business Software
$300 Free Credits for Your Google Cloud Projects Icon
$300 Free Credits for Your Google Cloud Projects

Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
Start Free Trial
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Vision Agents!

Additional Project Details

Programming Language

Python

Related Categories

Python Agentic AI Tool

Registered

2026-06-09