Simple command-line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). In true deep learning fashion, more layers will yield better results. Default is at 16, but can be increased to 32 depending on your resources. Technique first devised and shared by Mario Klingemann, it allows you to prime the generator network with a starting image, before being steered towards the text. Simply specify the path to the image you wish to use, and optionally the number of initial training steps. We can also feed in an image as an optimization goal, instead of only priming the generator network. Deepdaze will then render its own interpretation of that image. The regular mode for texts only allows 77 tokens. If you want to visualize a full story/paragraph/song/poem, set create_story to True.

Features

  • This will require that you have an Nvidia GPU or AMD GPU
  • Recommended 16GB VRAM
  • Minimum requirements are 4GB VRAM
  • For Windows
  • Creates files with both the timestamp and the sequence number
  • Optimize for the interpretation of an image

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow Deep Daze

Deep Daze Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Deep Daze!

Additional Project Details

Operating Systems

Windows

Programming Language

Python

Related Categories

Python Terminals, Python Command Line Tools, Python AI Image Generators, Python Deep Learning Frameworks, Python Generative AI

Registered

2022-02-01