This repo contains the code for 1D tokenizer and generator
Local Lambda debug, CodeWhisperer, SAM/CFN syntax, etc.
Industry leading face manipulation platform
Director, Screenwriter, Producer, and Video Generator All-in-One
AI Fully Automated Short Video Engine
Visual Causal Flow
Witness the aha moment of VLM with less than $3
LTX-Video Support for ComfyUI
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)
AI tool that removes hardcoded subtitles and text from videos locally
Code for running inference and finetuning with SAM 3 model
Automated translation solution for visual novels
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Open source demo platform where you can easily showcase your AI models
Visual tool for building, testing, and deploying AI agent workflows
Open source multimodal creative AI assistant with infinite canvas tool
Free and source-available fair-code licensed workflow automation tool
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Official Python inference and LoRA trainer package
Skywork-R1V is an advanced multimodal AI model series
Visual intelligence for your home.
Tiny vision language model
Data manipulation and transformation for audio signal processing
A state-of-the-art open visual language model
A framework to enable multimodal models to operate a computer