Python Computer Vision & Video Analytics Framework With Batteries Incl
Vision-language-action model for robot control via images and text
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
Small 3B-base multimodal model ideal for custom AI on edge hardware
A Highly Detailed 4x Game Inspired by Caveman2Cosmos
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video
Enabling Green Video Streaming Over Internet of Things
Metric monocular depth estimation (vision model)
Privacy Platform | Encrypt Locally | Secure Remotely
Offline AI orchestration with a modern UI & model integration