Implementation of NWT, audio-to-video generation, in Pytorch. The paper proposes a new discrete latent representation named Memcodes, which can be succinctly described as a type of multi-head hard-attention to learned memory (codebook) key/values. They claim the need for less codes and smaller codebook dimensions in order to achieve better reconstructions.
Features
- Implementation of NWT
- Audio-to-video generation
- For Pytorch
- Multi-head hard-attention to learned memory (codebook) key / values
- Smaller codebook dimension
- Achieve better reconstructions
License
MIT LicenseFollow NWT - Pytorch (wip)
Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit
Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of NWT - Pytorch (wip)!