AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
Build Vision Agents quickly with any model or video provider
Tool made to launch the popular Game Trainer / Cheat tool
ViewBot using requests updated 2025
Time-lapse Video Generation Models as Metamorphic Simulators
Video-based AI memory library. Store millions of text chunks in MP4
"VideoRAG: Chat with Your Videos
Trying to be a robust, user-friendly and hackable music player
Lets make video diffusion practical
The music player of today
A simple Python Pydantic model for Honkai
Capable of understanding text, audio, vision, video
Generate blog articles from video or audio
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Persepolis Download Manager is a GUI for aria2
Taming Stable Diffusion for Lip Sync
GPT4V-level open-source multi-modal model based on Llama3-8B
Douyin TikTok Download API
The Shiptest Codebase
Music player and music library manager for Linux, Windows, and macOS
Advancing Open-source World Models
Official MiniMax Model Context Protocol (MCP) server
Image polygonal annotation with Python
Harmonized and Coherent Human Image Animation