Real time face swap and one-click video deepfake
Open source AI Agents hosted on the oTTomator Live Agent Studio
LLM Large Model of Selling Anchor
A nearly-live implementation of OpenAI's Whisper
A robust, efficient, low-latency speech-to-text library
The open-source data curation platform for LLMs
Open Vision Agents by Stream. Build voice and vision agents quickly
Build Vision Agents quickly with any model or video provider
Visual intelligence for your home.
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Python & JS/TS SDK for running AI-generated code/code
NVR with realtime local object detection for IP cameras
Ready-to-run cloud templates for RAG
Document Image Parsing via Heterogeneous Anchor Prompting”
Python chatbot framework with Natural Language Understanding
Self-learning data agent that grounds its answers in layers of content
EPUB to audiobook converter, optimized for Audiobookshelf
AI Agent Evaluator & Red Team Platform
Data science on data without acquiring a copy
Framework for building real-time voice and multimodal AI agents
A text-to-speech, speech-to-text and speech-to-speech library
Framework for building realtime multimodal voice AI agents apps
Linkedin Automation Tool
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Anthropic's educational courses