RQ-Transformer

Implementation of RQ Transformer, which proposes a more efficient way of training multi-dimensional sequences autoregressively. This repository will only contain the transformer for now. You can use this vector quantization library for the residual VQ. This type of axial autoregressive transformer should be compatible with memcodes, proposed in NWT. It would likely also work well with multi-headed VQ. I also think there is something deeper going on, and have generalized this to any number of dimensions. You can use it by importing the HierarchicalCausalTransformer. For autoregressive (AR) modeling of high-resolution images, vector quantization (VQ) represents an image as a sequence of discrete codes. A short sequence length is important for an AR model to reduce its computational costs to consider long-range interactions of codes. However, we postulate that previous VQ cannot shorten the code sequence and generate high-fidelity images together in terms of the rate-distortion trade-off.

Features

Transformer can efficiently reduce the computational costs
Outperforms the existing AR models on various benchmarks of unconditional and conditional image generation
RQ-Transformer learns to predict the quantized feature vector at the next position by predicting the next stack of codes
Effectively generate high-resolution images
RQ-VAE can precisely approximate a feature map of an image and represent the image as a stacked map of discrete codes
Autoregressive Image Generation using Residual Quantization

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow RQ-Transformer

RQ-Transformer Web Site

User Reviews

Be the first to post a review of RQ-Transformer!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Image Generators, Python Generative AI

Registered

2023-03-22

Similar Business Software

GPT-NeoX

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library. This repository records EleutherAI's library for training large-scale language models on GPUs. Our current framework is based on NVIDIA's Megatron Language Model and has been augmented...

See Software
Pythia

Pythia combines interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers.

See Software
Coursebox AI

Transform your content into engaging eLearning experiences with Coursebox, the #1 AI-powered eLearning authoring tool. Our platform automates the course creation process, allowing you to design a structured course in seconds. Simply make edits, add any missing elements, and your course is ready...

See Software

Report inappropriate content

RQ-Transformer

Implementation of RQ Transformer, autoregressive image generation

Features

Project Samples

Project Activity

Categories

License

Follow RQ-Transformer

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered