Real time face swap and one-click video deepfake
Open source AI Agents hosted on the oTTomator Live Agent Studio
A nearly-live implementation of OpenAI's Whisper
LLM Large Model of Selling Anchor
A robust, efficient, low-latency speech-to-text library
Build Vision Agents quickly with any model or video provider
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The open-source data curation platform for LLMs
NVR with realtime local object detection for IP cameras
Visual intelligence for your home.
Ready-to-run cloud templates for RAG
Virtual AI anchor that combines state-of-the-art technology
Self-learning data agent that grounds its answers in layers of content
Document Image Parsing via Heterogeneous Anchor Prompting”
Open-Source Financial Large Language Models
EPUB to audiobook converter, optimized for Audiobookshelf
Python & JS/TS SDK for running AI-generated code/code
Qwen3-ASR is an open-source series of ASR models
Framework for building realtime multimodal voice AI agents apps
Framework for building real-time voice and multimodal AI agents
Linkedin Automation Tool
A text-to-speech, speech-to-text and speech-to-speech library
AI Agent Evaluator & Red Team Platform
Analyzing Hacker News discussions from a decade ago in hindsight
Python chatbot framework with Natural Language Understanding