Real time face swap and one-click video deepfake
Deploy your private Gemini application for free with one click
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Workflow and speech recognition app
Document Image Parsing via Heterogeneous Anchor Prompting”
AI-Powered Photos App for the Decentralized Web 🌈💎✨
A computer vision closed-loop learning platform
Open source framework for deep learning satellite and aerial imagery
Powerful open source image generation model
Virtual AI anchor that combines state-of-the-art technology
Visual Automation IDE — automate anything you see on screen
An AI assistant for everyone, powered by the Qwen series models
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Guide to deploying deep-learning inference networks
Explore how machine learning works, live in the browser
Nodejs bindings to OpenCV 3 and OpenCV 4
Nash Operating System for Modern Ecommerce
World's simplest facial recognition api for Python & the command line