Python data, Leaflet.js maps
Dealing with all unstructured data, such as reverse image search
A Unified Framework for Image Customization
Tensor search for humans
Gracefully face hCaptcha challenge with multimodal llms
Official implementation of Watermark Anything with Localized Messages
Personalize Any Characters with a Scalable Diffusion Transformer
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Towards Real-World Vision-Language Understanding
Multiprocess Selenium crawler for downloading images by keywords
Implementation of 'lightweight' GAN, proposed in ICLR 2021
2D and 3D Face alignment library build using pytorch
Recovering the Visual Space from Any Views
Enables the best performance on NVIDIA RTX Graphics Cards
Arch Linux installer - guided, templates etc.
Multi-user UI for managing and running Stable Diffusion workflows tool
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Capable of understanding text, audio, vision, video
Expressive Portrait Image Animation for Live Streaming
Contexts Optical Compression
Sharp Monocular Metric Depth in Less Than a Second
Generate high-definition story short videos with one click using AI
A lightweight vision library for performing large object detection
AutoGluon: AutoML for Image, Text, and Tabular Data