A lightweight vision library for performing large object detection
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
An open source implementation of CLIP
Official Python inference and LoRA trainer package
A Pioneering Open-Source Alternative to GPT-4o
Lets make video diffusion practical
Diffusion Transformer with Fine-Grained Chinese Understanding
RGBD video generation model conditioned on camera input
Automated nginx proxy for Docker containers using docker-gen
Implementation of a U-net complete with efficient attention
Multi-user UI for managing and running Stable Diffusion workflows tool
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
Contexts Optical Compression
Arch Linux installer - guided, templates etc.
Multilingual sentence & image embeddings with BERT
Code for running inference and finetuning with SAM 3 model
Dealing with all unstructured data, such as reverse image search
Recovering the Visual Space from Any Views
A Unified Framework for Image Customization
Tensor search for humans
Blender addons to make the bridge between Blender and geographic data
Fast-stable-diffusion + DreamBooth
Official implementation of Watermark Anything with Localized Messages
Personalize Any Characters with a Scalable Diffusion Transformer
Sharp Monocular Metric Depth in Less Than a Second