This repo contains the code for 1D tokenizer and generator
Industry leading face manipulation platform
AI Fully Automated Short Video Engine
Visual Causal Flow
Open source multimodal creative AI assistant with infinite canvas tool
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
AI tool that removes hardcoded subtitles and text from videos locally
Open source demo platform where you can easily showcase your AI models
LTX-Video Support for ComfyUI
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
A Pioneering Open-Source Alternative to GPT-4o
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Automated translation solution for visual novels
Director, Screenwriter, Producer, and Video Generator All-in-One
A state-of-the-art open visual language model
Witness the aha moment of VLM with less than $3
Skywork-R1V is an advanced multimodal AI model series
Code for running inference and finetuning with SAM 3 model
Official Python inference and LoRA trainer package
A framework to enable multimodal models to operate a computer
Tiny vision language model
Agent-ready RPA suite with visual workflow automation tools engine
Visual tool for building, testing, and deploying AI agent workflows
Recovering the Visual Space from Any Views
Mixture-of-Experts Vision-Language Models for Advanced Multimodal