This repo contains the code for 1D tokenizer and generator
Industry leading face manipulation platform
AI Fully Automated Short Video Engine
Visual Causal Flow
Director, Screenwriter, Producer, and Video Generator All-in-One
AI tool that removes hardcoded subtitles and text from videos locally
Open source demo platform where you can easily showcase your AI models
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
LTX-Video Support for ComfyUI
Witness the aha moment of VLM with less than $3
Tiny vision language model
Skywork-R1V is an advanced multimodal AI model series
Unified Multimodal Understanding and Generation Models
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
Code for running inference and finetuning with SAM 3 model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Automated translation solution for visual novels
Open source multimodal creative AI assistant with infinite canvas tool
A framework to enable multimodal models to operate a computer
A state-of-the-art open visual language model
Visual tool for building, testing, and deploying AI agent workflows
Suite of reference architectures for building GPU-accelerated vision
An extensive node suite that enables ComfyUI to process 3D inputs
StarVector is a foundation model for SVG generation
Official Python inference and LoRA trainer package