Code for running inference and finetuning with SAM 3 model
LTX-Video Support for ComfyUI
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Start directing AI agents
Open source MVVM framework for Web Apps
Low-code app builder for RAG and multi-agent AI applications
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Annotate and review coding agent plans visually, share with your team
Open Source AI Automation
Agent S: an open agentic framework that uses computers like a human
Extensible workflow development framework
Turns Data and AI algorithms into production-ready web applications
Recovering the Visual Space from Any Views
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Python inference and LoRA trainer package for the LTX-2 audio–video
Lets make video diffusion practical
Full-stack AI Red Teaming platform
Pretty diff to html javascript library (diff2html)
Just a Better Chatbot. Powered by MCP Client & Workflows
AI Product Design Agent
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Taming Stable Diffusion for Lip Sync
Lightning fast C++/CUDA neural network framework
"Big Model" trains a visual multimodal VLM with 26M parameters