Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
...Therefore, we need the loss to propagate back from the VAE's encoder part too, which introduces extra time costs in training. We use the multi-resolution grid encoder to implement the NeRF backbone (implementation from torch-ngp), which enables much faster rendering.