AV1 Image File Format Specification - ISO-BMFF/HEIF derivative
Lets make video diffusion practical
This repo contains the code for 1D tokenizer and generator
A lightweight vision library for performing large object detection
Sharp Monocular Metric Depth in Less Than a Second
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
A real-time visualisation of the CO2 emissions of electricity
MII makes low-latency and high-throughput inference possible
A Customizable Image-to-Video Model based on HunyuanVideo
Overcoming Data Limitations for High-Quality Video Diffusion Models
Stereo Photo Manipulation
An advanced file manager with qss themes and iso and folder previews
An open-source framework for training large multimodal models
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Insane(ly slow but wicked good) PNG image optimization
Based on the Disco Diffusion, version of the AI art creation software
A Unified Toolkit for Deep Learning Based Document Image Analysis
Implementation of Deep Feature Rotation for Multimodal Image
Punctuation restoration production-ready model for Russian language
Generate a preview gallery for your LUTs.
Fast, accurate and stable 3D dense face alignment
Composable GAN framework with api and user interface
Provides random wallpaper from webcams or saved images or both!
Compute FID scores with PyTorch