Python inference and LoRA trainer package for the LTX-2 audio–video
Lets make video diffusion practical
AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
Powerful open source team chat application
Video-based AI memory library. Store millions of text chunks in MP4
Capable of understanding text, audio, vision, video
The music player of today
A simple Python Pydantic model for Honkai
100–200× Acceleration for Video Diffusion Models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Official MiniMax Model Context Protocol (MCP) server
"VideoRAG: Chat with Your Videos
Time-lapse Video Generation Models as Metamorphic Simulators
ViewBot using requests updated 2025
Music player and music library manager for Linux, Windows, and macOS
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Trying to be a robust, user-friendly and hackable music player
Taming Stable Diffusion for Lip Sync
Multimodal Diffusion with Representation Alignment
Tool made to launch the popular Game Trainer / Cheat tool
Director, Screenwriter, Producer, and Video Generator All-in-One
Image polygonal annotation with Python
Generate blog articles from video or audio
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Voice Recognition to Text Tool