This repo contains the code for 1D tokenizer and generator
Industry leading face manipulation platform
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
AI Fully Automated Short Video Engine
Director, Screenwriter, Producer, and Video Generator All-in-One
Visual Causal Flow
Witness the aha moment of VLM with less than $3
Code for running inference and finetuning with SAM 3 model
LTX-Video Support for ComfyUI
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
Automated translation solution for visual novels
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Open source demo platform where you can easily showcase your AI models
Visual tool for building, testing, and deploying AI agent workflows
Open source multimodal creative AI assistant with infinite canvas tool
Visual intelligence for your home.
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Free and source-available fair-code licensed workflow automation tool
Official Python inference and LoRA trainer package
Skywork-R1V is an advanced multimodal AI model series
Tiny vision language model
Data manipulation and transformation for audio signal processing
A state-of-the-art open visual language model
A framework to enable multimodal models to operate a computer
Edit videos with Claude Code