Build Vision Agents quickly with any model or video provider
Real time face swap and one-click video deepfake
A nearly-live implementation of OpenAI's Whisper
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Visual intelligence for your home.
Document Image Parsing via Heterogeneous Anchor Prompting”
NVR with realtime local object detection for IP cameras
Framework for building real-time voice and multimodal AI agents
Data Lake for Deep Learning. Build, manage, and query datasets
OpenFieldAI is an AI based Open Field Test Rodent Tracker
A computer vision framework to create and deploy apps in minutes
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Telegram Group Calls Streaming bot with some useful features
Telegram bot to stream videos in telegram voicechat for both groups
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
World's simplest facial recognition api for Python & the command line