From Images to High-Fidelity 3D Assets
Implementation of a U-net complete with efficient attention
Programs to process GoPro MP4 & Generic GPX/FIT files
Repo for SeedVR2 & SeedVR
Official inference repo for FLUX.2 models
The most powerful and modular diffusion model GUI, api and backend
AI-powered tool for generating, optimizing, and translating subtitles
AI framework for automated short video creation and editing tools
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
End-to-end pipeline converting generative videos
Motion-controllable Video Generation via Latent Trajectory Guidance
Video understanding codebase from FAIR for reproducing video models
Oobabooga - The definitive Web UI for local AI, with powerful features
Streaming Real-time Audio-Driven Avatar Generation
Uncommon Objects in 3D dataset
Foundation model for image generation
Convert AI papers to GUI
An open-source, ultra-low-latency remote desktop for Linux hosts
Official code for StoryMem: Multi-shot Long Video Storytelling
Lets make video diffusion practical
Blender Model Context Protocol Integration
Unofficial Python API and agentic skill for Google NotebookLM
Instill Core is a full-stack AI infrastructure tool for data
Advancing Open-source World Models
Data Infrastructure providing an approach to multimodal AI workloads