Director, Screenwriter, Producer, and Video Generator All-in-One
LTX-Video Support for ComfyUI
A state-of-the-art open visual language model
Official SeedVR2 Video Upscaler for ComfyUI
A simple tool for reading in poorly redacted documents
The most powerful Android RPA agent framework
Agent-ready RPA suite with visual workflow automation tools engine
A Grub Theme in the style of Minecraft!
Official Python inference and LoRA trainer package
A framework to enable multimodal models to operate a computer
Tiny vision language model
SAPIEN Manipulation Skill Framework
Recovering the Visual Space from Any Views
Parse files for optimal RAG
Machine Learning, Criticism and Correction
Turn WiFi signals into real-time human pose estimation and detection
StarVector is a foundation model for SVG generation
Unified Multimodal Understanding and Generation Models
Book_4_Matrix Power | The Iris Book: From Addition, Subtraction
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Effortless data labeling with AI support from Segment Anything
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Extension of Google Research’s PaperBanana
A neural network that transforms a design mock-up into static websites
"VideoRAG: Chat with Your Videos