Implementation of NWT, audio-to-video generation, in Pytorch. The paper proposes a new discrete latent representation named Memcodes, which can be succinctly described as a type of multi-head hard-attention to learned memory (codebook) key/values. They claim the need for less codes and smaller codebook dimensions in order to achieve better reconstructions.

Features

  • Implementation of NWT
  • Audio-to-video generation
  • For Pytorch
  • Multi-head hard-attention to learned memory (codebook) key / values
  • Smaller codebook dimension
  • Achieve better reconstructions

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow NWT - Pytorch (wip)

NWT - Pytorch (wip) Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of NWT - Pytorch (wip)!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Video Generators, Python Generative AI

Registered

2023-03-22