Capable of understanding text, audio, vision, video
MII makes low-latency and high-throughput inference possible
Lightning fast C++/CUDA neural network framework
A Universal Customization Method for Single and Multi Conditioning
The data structure for multimodal data
Advancing Open-source World Models
Easy Docker setup for Stable Diffusion with user-friendly UI
Ready-to-run Docker images containing Jupyter applications
RGBD video generation model conditioned on camera input
Modern C++ Terminal Emulator
Geometric deep learning extension library for PyTorch
A python library for self-supervised learning on images
Gemma open-weight LLM library, from Google DeepMind
Instant neural graphics primitives: lightning fast NeRF and more
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
YOLOv5 is the world's most loved vision AI
Photo/Video/GIF enlargement using machine learning
mwayne's Dev Tests for PortableApps.com
A state-of-the-art open visual language model
A pipeline framework for developing video and image processing apps
Hardware Info for Linux portable AppImage + Benchmark
Image processing App for Windows Desktop
A Customizable Image-to-Video Model based on HunyuanVideo
High-performance image processing C++ application for Windows Desktop
AI Suite for upscaling, interpolating & restoring images/videos