AI-powered video generation skill for OpenClaw
Open-Sora: Democratizing Efficient Video Production for All
RGBD video generation model conditioned on camera input
LTX-Video Support for ComfyUI
AI video generator optimized for low VRAM and older GPUs use
Synchronized Translation for Videos
Time-lapse Video Generation Models as Metamorphic Simulators
Python inference and LoRA trainer package for the LTX-2 audio–video
Lets make video diffusion practical
100–200× Acceleration for Video Diffusion Models
AI-powered tool for generating, optimizing, and translating subtitles
Build Vision Agents quickly with any model or video provider
AI tool converting video/audio into structured documents instantly
Taming Stable Diffusion for Lip Sync
Generate blog articles from video or audio
Recovering the Visual Space from Any Views
GPT4V-level open-source multi-modal model based on Llama3-8B
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Voice Recognition to Text Tool
AI Slack bot for reading, summarizing, and chatting with content
Public opinion analysis system
Effortless data labeling with AI support from Segment Anything
Unofficial Python API and agentic skill for Google NotebookLM
Benchmark LLMs by fighting in Street Fighter 3
Private chat with local GPT with document, images, video, etc.