This repo contains the code for 1D tokenizer and generator
Industry leading face manipulation platform
AI Fully Automated Short Video Engine
Visual Causal Flow
Director, Screenwriter, Producer, and Video Generator All-in-One
Open image model at the forefront of design
AI tool that removes hardcoded subtitles and text from videos locally
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
Open source demo platform where you can easily showcase your AI models
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
An extensive node suite that enables ComfyUI to process 3D inputs
Visual intelligence for your home.
Open source multimodal creative AI assistant with infinite canvas tool
Visual tool for building, testing, and deploying AI agent workflows
LTX-Video Support for ComfyUI
Witness the aha moment of VLM with less than $3
Skywork-R1V is an advanced multimodal AI model series
Tiny vision language model
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Code for running inference and finetuning with SAM 3 model
Automated translation solution for visual novels
SAPIEN Manipulation Skill Framework
Parse files for optimal RAG
"VideoRAG: Chat with Your Videos
Official Python inference and LoRA trainer package