Unified Multimodal Understanding and Generation Models
Create architecture diagrams from code automatically using LLMs
Autoregressive Model Beats Diffusion
StarVector is a foundation model for SVG generation
This repository contains the official implementation of FastVLM
Visual editor for AI prompts with translation, categories, and tools
An open-source visual programming environment
A macOS menu bar application that monitors AI coding assistant usage
"VideoRAG: Chat with Your Videos
An LLM-based presentation generation platform
Recovering the Visual Space from Any Views
Edit videos with Claude Code
Wan2.1: Open and Advanced Large-Scale Video Generative Model
3D Computer Vision Framework
Weaving the Digital Agent Galaxy
The easy-to-use Vue low-code visual AI form designer
Visual intelligence for your home.
The first open-source agent skills builder
The visual feedback tool for agents
A workflow execution platform built on top of the fantastic Cloudflare
Doom-based AI research platform for reinforcement learning
Taming Stable Diffusion for Lip Sync
A modern model graph visualizer and debugger
Vision-based AI framework for cross-platform UI automation tasks
Python inference and LoRA trainer package for the LTX-2 audio–video