min(DALL·E)

This is a fast, minimal port of Boris Dayma's DALL·E Mini (with mega weights). It has been stripped down for inference and converted to PyTorch. The only third-party dependencies are numpy, requests, pillow and torch. The required models will be downloaded to models_root if they are not already there. Set the dtype to torch.float16 to save GPU memory. If you have an Ampere architecture GPU you can use torch.bfloat16. Set the device to either cuda or "cpu". Once everything has finished initializing, call generate_image with some text as many times as you want. Use a positive seed for reproducible results. Higher values for supercondition_factor result in better agreement with the text but a narrower variety of generated images. Every image token is sampled from the top_k most probable tokens. The largest logit is subtracted from the logits to avoid infs. The logits are then divided by the temperature. If is_seamless is true, the image grid will be tiled in token space not pixel space.

Features

Generate a 3x3 grid of DALL·E Mega images
Save individual images
Progressive Outputs
Command Line
Fast, minimal port of Boris Dayma's DALL·E Mini
Stripped down for inference and converted to PyTorch

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow min(DALL·E)

min(DALL·E) Web Site

User Reviews

Be the first to post a review of min(DALL·E)!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Image Generators, Python Generative AI

Registered

2022-08-04

Similar Business Software

PyTorch

Transition seamlessly between eager and graph modes with TorchScript, and accelerate the path to production with TorchServe. Scalable distributed training and performance optimization in research and production is enabled by the torch-distributed backend. A rich ecosystem of tools and libraries...

See Software
DeepSpeed

DeepSpeed is an open source deep learning optimization library for PyTorch. It's designed to reduce computing power and memory use, and to train large distributed models with better parallelism on existing computer hardware. DeepSpeed is optimized for low latency, high throughput...

See Software
Groq

Groq is on a mission to set the standard for GenAI inference speed, helping real-time AI applications come to life today. An LPU inference engine, with LPU standing for Language Processing Unit, is a new type of end-to-end processing unit system that provides the fastest inference for...

See Software

Report inappropriate content

min(DALL·E)

min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch

Features

Project Samples

Project Activity

Categories

License

Follow min(DALL·E)

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered