Recovering the Visual Space from Any Views
Contexts Optical Compression
Diffusion Transformer with Fine-Grained Chinese Understanding
Official implementation of DreamCraft3D
Sharp Monocular Metric Depth in Less Than a Second
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large-language-model & vision-language-model based on Linear Attention
AI Suite for upscaling, interpolating & restoring images/videos