Real time face swap and one-click video deepfake
Open source AI Agents hosted on the oTTomator Live Agent Studio
LLM Large Model of Selling Anchor
A nearly-live implementation of OpenAI's Whisper
A robust, efficient, low-latency speech-to-text library
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The open-source data curation platform for LLMs
NVR with realtime local object detection for IP cameras
Visual intelligence for your home.
Build Vision Agents quickly with any model or video provider
Virtual AI anchor that combines state-of-the-art technology
Python & JS/TS SDK for running AI-generated code/code
Ready-to-run cloud templates for RAG
Self-learning data agent that grounds its answers in layers of content
Document Image Parsing via Heterogeneous Anchor Prompting”
Framework for building real-time voice and multimodal AI agents
Open-Source Financial Large Language Models
A text-to-speech, speech-to-text and speech-to-speech library
Qwen3-ASR is an open-source series of ASR models
AI Agent Evaluator & Red Team Platform
Python chatbot framework with Natural Language Understanding
Framework for building realtime multimodal voice AI agents apps
Linkedin Automation Tool
Code to accompany "A Method for Animating Children's Drawings"
DeepMind model for tracking arbitrary points across videos & robotics