Image/video AI upscaler app (BSRGAN)
3D reconstruction software
OCRmyPDF adds an OCR text layer to scanned PDF files
InvokeAI is a leading creative engine for Stable Diffusion models
Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
A gradio web UI for running Large Language Models like LLaMA
Open source personal AI Assistant for Linux, Windows and Mac
Label Studio is a multi-type data labeling and annotation tool
A community-supported supercharged version of paperless
Stable Diffusion built-in to Blender
Implementation of Make-A-Video, new SOTA text to video generator
An unsupervised and free tool for image and video dataset analysis
Fast image augmentation library and an easy-to-use wrapper
Implementation of Phenaki Video, which uses Mask GIT
Chat-based assistant that understands tasks
A walk along memory lane
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Easily turn large sets of image urls to an image dataset
The ultimate tool to automate custom telegram message forwarding
A Python library for turning text quotes into graphical images
Images to inference with no labeling
High quality, fast, modular reference implementation of SSD in PyTorch
Visual localization made easy with hloc
A Telegram RSS bot that cares about your reading experience
Algorithms for outlier, adversarial and drift detection