Director, Screenwriter, Producer, and Video Generator All-in-One
Swing Music is a beautiful, self-hosted music player
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Multimodal Diffusion with Representation Alignment
Voice Recognition to Text Tool
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Ffree local self hosted video compressor webui
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Python data, Leaflet.js maps
The Ren'Py Visual Novel Engine
An unsupervised and free tool for image and video dataset analysis
The most powerful and modular diffusion model GUI, api and backend
Open source terminal session recorder
Implementation of a U-net complete with efficient attention
PyTorch code and models for VJEPA2 self-supervised learning from video
Public opinion analysis system
Python Socket.IO server and client
NBA Stats API via Basketball Reference
Lightweight Python library for adding real-time multi-object tracking
Official code for StoryMem: Multi-shot Long Video Storytelling
Streaming Real-time Audio-Driven Avatar Generation
Code for running inference and finetuning with SAM 3 model
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop
Private chat with local GPT with document, images, video, etc.
Convert various image, audio and video formats from your context menu.