Real time face swap and one-click video deepfake
Open source AI Agents hosted on the oTTomator Live Agent Studio
LLM Large Model of Selling Anchor
A nearly-live implementation of OpenAI's Whisper
A robust, efficient, low-latency speech-to-text library
NVR with realtime local object detection for IP cameras
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The open-source data curation platform for LLMs
Python & JS/TS SDK for running AI-generated code/code
Build Vision Agents quickly with any model or video provider
A meta-harness for all your AI agents
Open Vision Agents by Stream. Build voice and vision agents quickly
Visual intelligence for your home.
Ready-to-run cloud templates for RAG
Self-learning data agent that grounds its answers in layers of content
Document Image Parsing via Heterogeneous Anchor Prompting”
Linkedin Automation Tool
Framework for building realtime multimodal voice AI agents apps
Framework for building real-time voice and multimodal AI agents
Open-Source Financial Large Language Models
AI Agent Evaluator & Red Team Platform
DeepMind model for tracking arbitrary points across videos & robotics
Qwen3-ASR is an open-source series of ASR models
Analyzing Hacker News discussions from a decade ago in hindsight
Python chatbot framework with Natural Language Understanding