TRELLIS 2
Native and Compact Structured Latents for 3D Generation
...At its core is a novel sparse voxel structure called O-Voxel that jointly encodes both geometry and surface appearance, enabling reconstruction and generation of complex 3D shapes with arbitrary topology, open surfaces, and physically based rendering (PBR) textures. The system leverages a large 4-billion-parameter architecture combining sparse 3D variational autoencoders with flow-matching transformers to produce fully textured 3D models at resolutions up to 1536³ voxels. TRELLIS.2 emphasizes speed and compact latent representation, allowing bidirectional conversion between mesh formats and internal representations with minimal preprocessing and optimized performance on high-end GPUs.