Visual Automation IDE — automate anything you see on screen
Plug-n-play module turning text-to-image models into animation
dashAI: an interactive platform for training, evaluating and deploying
Visual Instruction Tuning: Large Language-and-Vision Assistant
computer vision projects | Fun AI projects related to computer vision
Guiding Instruction-based Image Editing via Multimodal Large Language
Open-source tool to visualise your RAG
CS2, Valorant, Fortnite, APEX, every game
Library of self-supervised methods for visual representation
Creation of a Taylorplot for several machine learning models
Official code for Style Aligned Image Generation via Shared Attention
Consistency Distilled Diff VAE
Implementation of Nougat Neural Optical Understanding
Visual localization made easy with hloc
Task-oriented finetuning for better embeddings on neural search
Enable sending and receiving images during chatting
A latent text-to-image diffusion model
Implementation of BEVFormer, a camera-only framework
Visual analysis and diagnostic tools to facilitate ML selection
Code release for ConvNeXt model
Machine learning glossary
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Generative Adversarial Transformers