Implementation of Imagen, Google's Text-to-Image Neural Network that beats DALL-E2, in Pytorch. It is the new SOTA for text-to-image synthesis. Architecturally, it is actually much simpler than DALL-E2. It consists of a cascading DDPM conditioned on text embeddings from a large pre-trained T5 model (attention network). It also contains dynamic clipping for improved classifier-free guidance, noise level conditioning, and a memory-efficient unit design. It appears neither CLIP nor prior network is needed after all. And so research continues. For simpler training, you can directly supply text strings instead of precomputing text encodings. (Although for scaling purposes, you will definitely want to precompute the textual embeddings + mask)

Features

  • The number of textual captions must match the batch size of the images
  • With the ImagenTrainer wrapper class, the exponential moving averages for all of the U-nets in the cascading DDPM will be automatically taken care of when calling update
  • You can also train Imagen without text (unconditional image generation)
  • You can also rely on the ImagenTrainer to automatically train off DataLoader instances
  • Generate an image with any text prompt using one command line
  • Imagen uses an algorithm called Classifier Free Guidance

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Imagen - Pytorch

Imagen - Pytorch Web Site

You Might Also Like
Automated quote and proposal software for IT solution providers. | ConnectWise CPQ Icon
Automated quote and proposal software for IT solution providers. | ConnectWise CPQ

Create IT quote templates, automate workflows, add integrations & price catalogs to save time & reduce errors on manual data entry & updates.

ConnectWise CPQ, formerly ConnectWise Sell, is a professional quote and proposal automation software for IT solution providers. ConnectWise CPQ offers a wide range of tools that enables IT solution providers to save time, quote more, and win big. Top features include professional quote or proposal templates, product catalog and sourcing, workflow automation, sales reporting, and integrations with best-in-breed solutions like Cisco, Dell, HP, and Salesforce.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Imagen - Pytorch!

Additional Project Details

Programming Language

Python

Related Categories

Python Networking Software, Python Machine Learning Software, Python Neural Network Libraries

Registered

2022-08-02